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COVERAGE OF JOURNAL 


THE aim of this journal of abstracts is to give complete coverage of papers in the field of statistical 
theory and new contributions to statistical method. Papers which report only applications or examples 
of existing statistical theory and method will not be included. There are approximately two hundred 
and fifty journals published in various parts of the world which are wholly or partly devoted to 
the field of statistical theory and method and which will be brought within the scope of this journal of 
abstracts. In due course it will be possible to issue a complete list of these journals. In the case of the 
following journals, however, being those which are wholly devoted to statistical theory—all contributions, 
whether a paper, note or miscellanea, will be abstracted : 

Annals of Mathematical Statistics, 

Biometrika 

Journal, Royal Statistical Society (Series B) 

Bulletin of Mathematical Statistics 

Annals, Institute of Statistical Mathematics 
Within the larger group of journals, which are not wholly devoted to statistical theory and method, there 
are some journals which have the vast majority of their contributions in this field. These journals, there- 
fore, will be abstracted on a virtually complete basis : 

Biometrics 

Metrika 

Metron 

Review, International Statistical Institute 

_ Technometrics 

Sankhya 
After experience in the publication of this journal of abstracts it may be found desirable to add further 
journals to these lists. In any case readers may be assured that all papers properly to be included in this 
journal of abstracts will be included irrespective of such notification. If any reader of this journal discovers 
a paper which happens to have been overlooked, the General Editor will be pleased to be informed so 
that the appropriate abstract can be made: always provided that the date of publication is after 1st 
October 1958, when the abstracting for this journal commenced. 

In addition to the ordinary journals, there are two kinds of publication which fall within the scope 
of this journal of abstracts. They are the experiment and other research station reports—which occur 
particularly in the North American region—and the reports of conferences, symposia and seminars. 
Whilst these latter may be included in the book review sections of journals it is unusual for any individual 
contribution to be noted at any length. These publications are, in effect, special collections of papers 
and for this reason the appropriate arrangements will be made for them to be included in this journal. 
By the same token, abstracts of papers given at conferences and reproduced in an appropriate journal will 
be disregarded until the definitive publication is available. 


FORMAT OF JOURNAL 

The abstracts will be up to 400 words long—the recommendation of UNESCO for the “long ” 
abstract service: they will be in the English language although the language of the original paper will 
be indicated on the abstract together with the name of the abstractor. In addition, the address of the 
author(s) will be given in sufficient detail to facilitate contact in order to obtain further detail or request 


an offprint. Suitable indexes will be provided annually in a supplement. 
iii 


The scheme of classification has been developed upon lines that will facilitate the transfer to punched 
cards of the code numbers allocated to each abstract: to allow for future development it is suggested that use 
is made of 4-column fields. Each abstract will have two classification numbers: the primary number in 
heavy type to indicate the basic topic of the paper and the secondary number in brackets to take account of 
the most important cross-reference. The sheets of the journal are colour-coded according to the twelve main 
sections of the classification and it should be noted that it is the main section number of the primary 
classification which determines the colour code for each abstract. It is believed that this method of 
colour-coding the pages will provide a distinctive visual aid to the identification of abstracts both when 
the journal is in bound form or dismantled and filed on cards or in binders. The format and simple 
binding allows of the following alternative treatments by users of the journal : 

(a) Leave intact as a shelf-periodical. 

(b) Split and filed in page form according to the main sections of the classification. 

(c) Split and guillotine (single cut) each page ready for : 

(i) pasting onto standard index cards. 
(ii) filing in loose-leaf or other binders—for which the appropriate holes are punched. 


It will also be possible for those users who need a completely referenced file to insert skeleton cards or 
sheets according to the secondary classification number provided on each abstract. 
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LIST OF PUBLICATIONS FOR REGULAR SCANNING 


The following list of publications in the statistical and allied fields will be examined regularly 
for papers which should be abstracted for this Journal. Other journals will also be scrutinised 
from time to time and a check maintained by scanning other relevant abstracting journals. 
In addition, abstracts will be included of special collections of papers as published in reports 
of conferences, symposia and seminars together with the published reports of experiment and 


other research stations. 


Acta Acad. Cient., Lima 
Acta Math., Stockh. 

Acta Math. Acad. Sci. Hung. 
Acta Math. Sinica 

Acta Pontif. Acad. Sci. 
Advanced Management 
Advance. Enzymol. 
Advanc. Protochem. 
Advance. Vet. Sci. 

Agric. Engng., St Joseph, Mich. 
Agriculture, Lond. 

Agron. fF. 

Allg. Statist. Arch. 

Amer. Econ. Rev. 

Amer. Heart fF. 

Amer. F. Clin. Nutr. 
Amer. J. Clin. Path. 
Amer. F. Digestive Diseases 
Amer. F. Hum. Genet. 
Amer. }. Hyg. 

Amer. F. Med. 

Amer. F. Physiol. 

Amer. }. Publ. Health 
Amer. F. Sct. 

Amer. F. Sociol. 

Amer. F. Vet. Res. 

Amer. Nat. 

Amer. Statistician 

Analyt. Chem. 

An. Soc. Cient. Argent. 
Ann. Acad. Sci. Fenn. 
Ann. Appl. Biol. 

Ann. Fac. Econ. Com. Catania 


Ann. Fac. Econ. Com. Palermo 


Ann. Fac. Sct. Porto 

Ann. Hum. Genet. 

Ann. Inst. Poincaré 

Ann. Inst. Statist. Math., Tokyo 

Ann. Inst. Sup. Sci. Econ. 
Financ. 

Ann. Ist. Statist. Bari 

Ann. Math. Statist. 

Ann. N.Y. Acad. Sct. 

Ann. Sci. Ec. Norm. Sup., Paris 

Ann. Sci. Econ. Appl. 

Ann. Soc. Sci. Bruxelles, 1 


publication 


Acta de la Academia Cientifica, Lima 

Acta Mathematica 

Acta Mathematica Academiae Scientiarum Hungaricae 

Acta Mathematica Sinica 

Acta Pontificiae Academiae Scientiarum 

Advanced Management 

Advances in Enzymology and Related Subjects 

Advances in Protochemistry 

Advances in Veterinary Science 

Agricultural Engineering 

Agriculture (Min. of Agric.) 

Agronomy Journal : 

Allgemeines Statistisches Archiv 

American Economic Review 

American Heart Journal 

American Journal of Clinical Nutrition 

American Journal of Clinical Pathology 

American Journal of Digestive Diseases 

American Journal of Human Genetics 

American Journal of Hygiene 

American Journal of Medicine 

American Journal of Physiology 

American Journal of Public Health 

American Journal of Science 

American Journal of Sociology 

American Journal of Veterinary Research 

American Naturalist 

American Statistician 

Analytical Chemistry 

Anales de la Sociedad Cientifica Argentina 

Annales Academiae Scientiarum Fennicae 

Annals of Applied Biology 

Annali della Facolta di Economia e Commercio, 
Catania (University) 

Annali della Facolta di Economia e Commercio, 
Palermo (University) 

Annaes da Faculdade de Ciéncias do Porto 

Annals of Human Genetics (London) 

Annales de l'Institut Henri Poincaré 

Annals of the Institute of Statistical Mathematics 

Annaes do Instituto Superior de Ciéncias Economicas 
e Financieras 

Annali dell’Istituto di Statistica, Bari (University) 

Annals of Mathematical Statistics 

Annals of the New York Academy of Science 

Annales Sciéntifiques de l’Ecole Normale Supérieure 

Annales de Sciences Economique Appliquées 

Annales de la Société Scientifique de Bruxelles— 
Serie 1 
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The coverage of this Journal amounts to at least 400 sources of 


Peru 
Sweden 
Hungary 
China 


Argentina 
Finland 
Great Britain 
Italy 


Italy 


Portugal 
Great Britain 
France 

Japan 
Portugal 


Italy 

WES VAS 
TWESEAG 
France 
France 


Belgium 


Ann. Statist., Rome 


Ann. Univ. Lyon, A 


Ann. Univ. Sci. Budapest. Sect. 


Math. 
Annu. Rev. Physiol. 
Aplik. Mat. 
Appl. Sci. Res., Hague 
Appl. Statist. 
Archimede 
Arch. Biochem. Biophys. 
Arch. Math., Karlsruhe 
Ark. Mat. 
Atti (mem.) Accad. Lincei 


Atti Riun. Sci., Soc. Ital. Statist. 


Aust. F. Agric. Res. 
Aust. J. Appl. Sci. 
Aust. F. Biol. Sci. 
Aust, ¥. Phys. 
Aust. F. Sct. 

Aust. F. Statist. 


Beil. Mber. Ost. Inst. 
Wirtschaftsf. 

Bell Lab. Record 

Bell Syst. Monographs 

Bell Syst. Tech. F. 

Biometrics 

Biometrika 

Biom. Zeit. 

bla. Dtsch. Ges. Versich.-math. 


Bol Fac. Filos. Ciént., S. Paulo 


Bol. Inst. Actuar. Port. 
Bol. Soc. Mat. Mexicana 
Boll. Cent. Ric. Operat. 
Boll. Un. Mat. Ital. 

Brit. J. Nutr. 

Brit. F. Prev. Soc. Med. 
Brit. F. Sociol. 

Brit. ¥. Statist. Psychol. 
Bull. Acad. Polon. Sci. WI 
Bull. Amer. Soc. Test. Mat. 
Bull. Ass. Actuair. Suisses 
Bull. Calcutta Math. Soc. 
Bull. Calcutta Statist. Ass. 
Bull., C.S.I.R.O., Aust. 


Bull. Indian Soc. Qual. Contr. 
Bull. Indian Stand. Inst. 

Bull. Inst. Egypte 

Bull. Fap. Statist. Soc. 

Bull. Math. Statist. 


Bull. Nat. Inst. Sct. India 
Bull. Oxf. Univ. Inst. Statist. 
Bull. Res. Inst. Kerala 

Bull. Sci. Math. 

Bull. Soc. Math. Belg. 


Annali di Statistica (Istituto Centrale di Statistica, 
Rome) 

Annales de l’Université de Lyon—Section A 

Annales Universitatis Scientiarum Budapestiensis de 
Rolando Eétvés nominatae—Sectio Mathematica 

Annual Review of Physiology 

Aplikace Matematiky 

Applied Scientific Research, The Hague 

Applied Statistics 

Archimede 

Archives of Biochemistry and Biophysics 

Archiv der Mathematik 

Arkiv for Matematik 

Atti della Accademia Nazionale dei Lincei 

Atti della Riunione Scientifica, Societa Italiana di 
Statistica, Rome 

Australian Journal of Agricultural Research 

Australian Journal of Applied Science 

Australian Journal of Biological Sciences 

Australian Journal of Physics 

Australian Journal of Science 

Australian Journal of Statistics 


Beilagen zu den Monatsberichten des Osterreichischen 
Institutes fir Wirtschaftsforschung 

Bell Laboratories Record 

Bell System Monographs 

Bell System Technical Journal 

Biometrics 

Biometrika 

Biometrische Zeitschrift 

Blatter der, Deutschen Gesellschaft fiir Versicherungs- 
mathematik 

Boletim da Faculdade de Filosofia, Ciéncias e Letras, 
Sao Paulo 

Boletim do Instituto dos Actuarios Portugueses 

Boletim de la Sociedad Matematica Mexicana 

Bollettino del Centro per la Ricerca Operativa 

Bollettino dell’ Unione Matematica Italiana 

British Journal of Nutrition 

British Journal of Preventive and Social Medicine 

British Journal of Sociology 

British Journal of Statistical Psychology 

Bulletin de l’Académie des Sciences (Classe 3) 

Bulletin, American Society for Testing Materials 

Bulletin de l’Association des Actuaires Suisses 

Bulletin of the Calcutta Mathematical Society 

Bulletin, Calcutta Statistical Association 

Bulletins, Commonwealth Scientific and Industrial 
Research Organisation 

Bulletin, Indian Society of Quality Control 

Bulletin, Indian Standards Institution 

Bulletin de I’Institut d’Egypte, Cairo 

Bulletin of the Japan Statistical Society 

Bulletin of Mathematical Statistics, Research Associa- 
tion of Statistical Sciences 

Bulletin, National Institute of Sciences of India 

Bulletin of the Oxford University Institute of Statistics 


Bulletin, Central Research Institute, Kerala (University) 


Bulletin des Sciences Mathématique 
Bulletin de la Société Mathématique de Belgique, 
Gembloux 
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Italy 


France 
Hungary 


WAS 
Czechoslovakia 
Netherlands 
Great Britain 
Italy 

U.S.A. 
Germany 
Sweden 

Italy 

Italy 


Australia 
Australia 
Australia 
Australia 
Australia 
Australia 


Austria 


U.S.A. 
U.S.A. 
U.S.A. 
U.S.A. 

Great Britain 
Germany 
Germany 


Brazil 


Portugal 
Mexico 

Italy 

Italy 

Great Britain 
Great Britain 
Great Britain 
Great Britain 
Poland 
WrSsAs 
Switzerland 
India 

India 
Australia 


India 
India 
Egypt 
Japan 
Japan 


India 

Great Britain 
India 

France 
Belgium 


Bull.. Soc. Math. France 
Bull. Statist. Soc., N.S.W. 


Canad. $. Econ. Polit. Sci. 
Cancer, New York 

Cas. pést. math. 

Chem. Engng. Progr. 

Clin. Chem. 

Colect. Estud. 

Coll. Mat. 

Collog. Math. 

Comment. Math. Helvetia 
Comment. Pontif. Acad. Sci. 
Compos. Math., Groningen 
C.R. Acad. Sci., Paris 
Cornell Vet. 

Current Science 

Curso Mira Fernandes 
Czech. Math. F. 


Dacca Univ. Statist. Bull. 

Defense Sci. F. 

Dokl. Akad. Nauk. SSSR 

Dopov. Akad. Nauk. Ukrain, 
SSR 


Econ. Appl. 

Econ. Internazionale 
Econ. f. 

Econ. Leban. Arabe 
Econ. Record 
Econometrica 
Economica 

Edinb. Math. Notes 
Egypte Contemp. 
Egyptian Statist. f. 
Ekon. Tidskr. 

Emp. }. Exp. Agric. 
Endocrinology 
Engineer, Lond. 
Engineering, Lond. 
Ergebn. Naturw. Grenzgeb. 
Esta 

Estadistica 

Euclides, Groningen 
Eugen. Rev. 
Evolution 


Food Res. 
Food Tech. 


Ganita 

Gaz. Mat. 

G. Economisti 

G. Ist. Ital. Attuari 
Genetics 

Genus 

Growth 


Harvard Busin. Rev. 
Helvetica Phys. Acta 


Bulletin de la Société Mathématique de France 
Bulletin of the Statistical Society, N.S. Wales 


Canadian Journal of Economic and Political Science 


Cancer (New York) 

Casopis pro péstovani mathematiky 
Chemical Engineering Progress 
Clinical Chemistry 

Colectanea de Estudos 

Collectanea Matematica 
Colloquium Mathematicum 
Commentarii Mathematica Helvetia 


Commentationes Pontificiae Academiae Scientiarum 


Compositio Mathematica, Groningen 


Comptes Rendus de |’Académie des Sciences, Paris 


Cornell Veterinarian 

Current Science 

Curso Mira Fernandes 
Czechoslowak Mathematical Journal 


Dacca University Statistical Bulletin 
Defense Science Journal 

Doklady Akademiia Nauk, SSSR. 

Dopovedy Akademiia Nauk Ukrainskoj, SSR 


Economie Appliquée 

Economia Internazionale 

Economic Journal 

L’Economie Lebanaise & Arabe, Beirut 
Economic Record 

Econometrica 

Economica 

Edinburgh Mathematical Notes 
L’Egypte Contemporaine, Cairo 
Egyptian Statistical Journal 

Ekonomisk Tidskrifts 

Empire Journal of Experimental Agriculture 
Endocrinology 

Engineer 

Engineering 

Ergebnisse der exakten Naturwissenschaften 
Esta 

Estadistica 

Euclides, Groningen 

Eugenics Review 

Evolution 


Food Research 
Food Technology 


Ganita, Lucknow-Allahabad (Universities) 
Gazeta de Matematica 

Giornale degli Economisti e Annali di Economia 
Giornale dell’ Istituto Italiano degli Attuari 
Genetics 

Genus 

Growth 


Harvard Business Review 
Helvetica Physica Acta 
Vii 


France 
Australia 


Canada 

U.S.A. 
Czechoslovakia 
U.S.A. 

U.S.A. 

Spain 

Spain 

Poland 
Switzerland 
Italy 
Netherlands 
France 

U.S.A. 

India 

Portugal 
Czechoslovakia 


Pakistan 
India 
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France 

Italy 

Great Britain 
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Australia 
WES ze 
Great Britain 
Great Britain 
Egypt 
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Sweden 
Great Britain 
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Great Britain 
Great Britain 
Germany 
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Netherlands 
Great Britain 
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India 
Portugal 
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Italy 
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U.S:A. 
Switzerland 


Heredity 
Hum. Biol. 


IFO-Studien 
Inc. Statist. 
Indag. Math. 


Indian Coun. Agric. Res. Bull. 
Indian Coun. Agric. Res. Statist. 


Newsletter 
Indian Econ. f. 
Indian Econ. Rev. 
Indian }. Agric. Econ. 
Indian #. Phys. 


Indian F. Power & River Devel. 


Indian F. Soc. Work 
Indian Math. Soc. f. 
Industr. Engng. Chem. 
Industr. Qual. Contr. 
Industria, Milano 

Iowa Acad. Sci. F. 

Towa Exp. Sta. Publ. 
Izv. Akad. Nauk, SSSR 


Izv. Akad. Nauk, Uzbek., SSR. 


4b. Nat.-Okon. Statist. 
Fber. Dtsch. Mathver. 
¥. Agric. Food Chem. 
J. Agric. Sct. 

J. Amer. Pharm. Ass. 
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F. Amer. Statist. Ass. 

JF. Amer. Vet. Med. Ass. 
JF. Animal Sci. 

J. Appl. Physiol. 
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F. Aust. Inst. Agric. Sci. 
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¥. Biol. Chem. 
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F. Clin. Invest. 
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F. Indian Soc. Agric. Statist. 


F. Inst. Actuar. 

F. Inst. Actuar. Stud. Soc. 
JF. Madras Univ., B 

J. Mammalogy 

JF. Marine Res. 

J. Math. Pures Appl. 
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J. Nutrition 
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Human Biology 
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Indian Economic Review 

Indian Journal of Agricultural Economics 
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(Scientific Edition) 
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AMATO, V. (University of Catania) 


A practical method for the inversion of Leontief’s matrix—In Italian 


Industria (1959) 297-313 (11 refs.) 


This article demonstrates a method proposed by the author 
for obtaining the inverse of Leontief’s matrix. 

After having stated the conditions for a matrix to be 
in conformity to Leontief’s definition, the author demon- 
strates the many difficulties which increase with the order 
of the matrix. He suggests decomposing the matrix into 
factors easily invertible and so arranged as to give the 
desired solution. 

The difficulty of the calculations that are required by 
the method may be met by the use of suitable electronic 
computers, 


(P. Bandettini) 
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CHERUBINO, S. (University of Pisa) 


On Leontief’s matrices and on a problem in linear programming—TJn Italian 


Industria (1959) 156-164 (11 refs.) 


The author, starting from the properties of matrices applied 
to the input-output analysis, introduces some variants in 
the demonstration and in the notation. He also considers 
a problem in linear programming. 

A solution is outlined which gives the maximum and 
minimum of two (dual) homogeneous linear functions. Of 
these two functions, one is the total production of the n 
sectors and the other cost prices. The production and price 
functions describe two convex polyhedral cones, the equations 
of which have the same Leontief’s matrix as the coefficients 
of the inequalities conditioning the maximum and _ the 
minimum. 

This problem suggests a general method for the inversion 
of Leontief’s matrices. 


(P. Bandettini) 
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KRELLE, W. (Universitit Bonn) 


08454) 


Discrete programming: theory and practical applications—In German 


Unternehmensforschung (1958) 4, 161-175 (10 refs.) 


Discrete programming is defined as the solution of extremal 
problems with equations or inequalities as constraints under 
the additional condition of discrete values for some or all 
variables. Many economic problems take this form. ‘The 
statement of the problem and its mathematical formulation 
are given for the following examples: 


(i) planning of production with indivisible final products, 
(ii) planning of production with indivisible factors, 


(iii) planning of production with indivisible factors and 
final products, 
(iv) planning of production with growing increase of 


profits, 
(v) assignment problem, 


(vi) localising problem, 

(vii) travelling salesmen problem, 

(viii) transportation problems: on the one hand for non- 
homogeneous goods and constant capacity of trans- 
portation units; on the other hand for different units 
with limited capacity and given frequency of trans- 
portation of persons, 

(ix) allocation of machines. 


Some methods of solving discrete programming problems 
are roughly outlined. The logical structure is always the 
same: from a certain number of combinations, that number 
will be chosen which yields an optimum with respect to a 
given criterion. Because of the immeasurably great number 
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LUKE, Y. L. (Midwest Research Institute, Kansas City) 


of possible combinations, one generally has at first to try 
to reduce as much as possible the number of variables and 
combinations by making use of the peculiarities of the 
special problem. 

It is often advantageous for discrete programming to use 
only such variables which assume values of 0 or 1. If the 
original mathematical formation does not have this form, 
it can be obtained by substitution: this way was chosen by 
Markowitz and Manne. In this way the possible combina- 
tions will be reduced, on the one hand by a definition of 
the maximum and minimum number of those variables 
which must be in the solution and, on the other hand, by 
determining those variables which are certainly not included 
in the solution, as well as those which definitely are included. 

If the constraints are not very important, the ‘‘ approxim- 
ation from above”’ is applied. First a discrete solution is 
determined without taking account of the constraints. After 
that one has to examine whether the constraints are fulfilled, 
and to resort to the second best value of the extremal 
function if this is not so, until a solution is obtained for which 
the constraints are satisfied. 

If the constraints are essential, a large number of com- 
binations become irrelevant. In this case there exist two 
ways of “‘ approximation from below ”’ to find all combina- 
tions which satisfy all constraints. The value of the extremal 
function has to be computed step by step after which the 
solution may be selected. 


(H. Kregeloh) 


0.4 (11.6) 


Expansion of the confluent hypergeometric function in series of Bessel functions—In English 


Math. Tab., Wash. (1959) 13, 261-271 (13 refs.) 


The confluent hypergeometric function, defined by 
© (@)2* 
k=0 (¢)eR! , 


where (a), = I'(a+hk)/T(a), and where I'(z) is the gamma 
function, is expanded in a series of Bessel functions of 
integral order whose argument is independent of a and c. 
This expansion is useful for computational purposes since 
the parameters and variable of the confluent hypergeometric 
function appear separately. Bessel functions of integral 
order are easy to generate on a computer and tables are 
available for desk calculation. 

If c is equal to a+1 the function reduces to the in- 
complete gamma function. The author develops several 
expansions for the incomplete gamma function and by a 
further specialisation of a = 4, obtains expansions for the 
normal integral. 

The author gives several numerical examples to illustrate 
the computational efficiency of the expansions. 


o(a, C, 2) Sail (50; 2) <a 


(D. Teichroew) 
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MORRISON, D. (Space Tech. Labs., Los Angeles) 
Numerical quadrature in many dimensions—In English 
JF. Ass. Comp. Mach. (1959) 6, 219-222 (2 refs.) 


Consider the problem of evaluating J, where I is given by 
a a n n 
L= { | Jt, a+) %y) 11 w(x) IT dx, 
—a ie Dail! t=1 


and where w(x) is a positive and even weighting function. 
A numerical integration formula is developed, the essential 
step being the reduction of the problem to one of approxi- 
mating integrals of functions with fewer variables. <A 
k-dimensional trace of a function of n variables is defined 
as the value of the function obtained by setting all variables 
equal to zero except k of them, e.g., the two-dimensional 
trace P(x), x3)f is f(x:, 0, x3, 0, ..., 0). It is then proved 
that if f is a sum of functions of d variables then f may be 
written in terms of traces of these variables. The numerical 
quadrature formula is developed explicitly for the case of 
two dimensions and a fifth order formula is obtained. By 
writing the function as a sum of its traces, J is reduced, 
in this case, to a problem of numerical quadrature on a 
square or on a line segment. Both of these problems are 
solved with the aid of a three-point Gauss quadrature 
formula which corresponds to the weighting function w(x), 
there the three points are (—«, 0, «) with weights (w,, wo, @). 
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0.3 (-.-) 


The formula becomes 


—1 2 
eee . (~) (a) 
j=1 j=it+l1 pa=1 


n 2 
+A" w,[—2on+oot4o,] 2 2 P(x?) f 
=al Fos! 


+A"-*[20%n? —2w,(w 9 +3u,)n +A? Pf, 


where x{? = —a and x’) = « and Pf = f(0, 0, ..., 0). 

In the special case where w(x) is equal to 1, A = 2, we 
have « = 3/5, wy = 8/9, w, = 5/9; e.g., the points and 
weights of the Gauss-Legendre quadrature formula. In 
this case the result reduces to one already given by Hammer 
and Stroud [Technical Report No. 5, Numerical Analysis 
Laboratory, University of Wisconsin]. 


(D. Teichroew) 


0.6 (11.6) 


A method for computing eigenvectors and eigenvalues‘on an analogue computer— 


In English 


Math. Tab., Wash. (1959) 13, 194-201 (8 refs., 2 tables, 1 fig.) 


The problem of computing eigenvectors and eigenvalues of 
an arbitrary matrix has been the subject of a number of 
papers in recent years. Most of the methods suggested 
have dealt with the use of digital computers; this paper gives 
a procedure using analogue equipment. 

The technique is based on the analogue computer 
method for solving problems in linear programming described 
by Pyne [Trans. Amer. Instn. Elect. Engrs., Part 1—Com- 
munications and Electronics (1956), 139-143]. The method 
consists of formulating the problem of determining eigen- 
values as an extremum problem, as follows: let x be the 
radius vector to the point X = (x, ..., %,) in m space. 
Let x1, %2, ..., %n be functions of time. The first eigen- 
vector e! is found by letting the point X move on the unit 
sphere until it reaches a steady state corresponding to a 
maximum of the function AX.X. Then e! equals the 
steady state of X. The other eigenvectors, e”, are found 
in the same way using additional restrictions on X such 
that X.e!= X.e7 =... = X.e*'=0. 

The analogue method described has the advantage of 
yielding both eigenvectors and eigenvalues of a real sym- 
metric or complex Hermitian matrix, and do not require 
a trial and error procedure. ‘This is accomplished at the 
expense of additional computing equipment. The paper 
gives the amount of computing equipment required to 


332 


compute an eigenvector. The amount of equipment depends 
on the size of the matrix and the number of eigenvectors 
desired. ‘The eigenvalues can be obtained using the com- 
puter or can be obtained by hand. Once an n by m matrix 
problem has been programmed, this programming and the 
associated patching can be used for any other 2 by matrix 
with only minor changes. 

The author gives two examples, one a 3 by 3 matrix 
and the other a 6 by 6 matrix, and shows the difference 
between the analogue and the correct results. ‘Three place 
accuracy can be obtained for the eigenvector and if the 
eigenvalue is computed by hand six place accuracy can be 
obtained. 


(D. Teichroew) 
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RALSTON, A. (Bell Telephone Labs., Whippany, N.J.) 


A family of quadrature formulas which achieve high accuracy in composite 


rules—In English 
JF. Ass. Comp. Mach. (1959) 6, 384-394 (2 refs.) 


A class of quadrature formulas is derived which achieve 
higher accuracy in composite rules (i.e., where the interval 
of integration is broken up into a number of sub-intervals) 
than analogous Newton-Cotes or Gaussian formulas. The 
cost of this higher accuracy is the computation of one or 
two more ordinates over the whole interval of integration. 

The high accuracy is obtained by using Gaussian tech- 
niques in the interior of each sub-interval and by using 
the end-points of each sub-interval as abscissas with weights 
of equal magnitude and opposite sign. In this way when 
the sub-intervals are put together only the end-points of 
the whole interval of integration remain. It is proved that 
the abscissas are all real and interior to the sub-interval 
and that the weights corresponding to the interior abscissas 
are positive. 

Since the abscissas are not equally spaced, the method 
is not suited to tabular functions but rather to analytically 
given functions. The round-off properties of the formulas 
are discussed and are shown to be quite good. 


(A. Ralston) 
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0.36.) 


0.3 (2.0) 


The use of the central limit theorem for interpolating in tables of probability 


distribution functions—In English 


Math. Tab., Wash. (1959) 13, 213-216 (6 refs.) 


It is well known that interpolation in tables of probability 
density and distribution functions is difficult, especially if 
the values desired are in the “‘tails’’. It is possible to 
make use of the central limit theorem to increase accuracy 
of interpolation for those distributions which tend to 
normality. 

As an example, consider the problem of determining 
the value of the binomial distribution E(m, 79, po+ Ad) 
where 7, is the sample size, 7) is the number of observed 
successes and p,+Ap is the parameter. Let us assume 
that the function is not tabulated for the argument p)+Ap. 
The standard interpolating procedure consists of taking the 
tabulated values corresponding to arguments near the desired 
ones and constructing a polynomial based on k+2 given 
points. The desired value is then obtained by evaluating 
the polynomial at the desired argument. The method 
proposed in this paper is the following: 


(i) Write down the functional values corresponding to 
the arguments given in the table. 
(ii) Look up in a normal table the normal deviates 
corresponding to these probabilities. 
(iii) Use the standard interpolating technique to inter- 
polate in these normal deviates for the value of the 
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normal deviate corresponding to the desired para- 
meters; e.g., the argument of the interpolating poly- 
nomial would be p+Ap in the above example. 

(iv) Look up in the table of the normal distribution 
the value corresponding to this calculated normal 
deviate. This is the desired value. 


A second approach suggested in this paper is to use, 
instead of the normal distribution in the above method, 
the logit function 


L(x) = 1/1 -+e-**, 


This function is relatively close to the normal distribution 
for particular values of f. 

These two methods have been compared with the ordinary 
method using both two and four point interpolation, in 
which interpolation is performed for 7) and 7» as well as 
for py. The four point normal deviate procedure was far 
superior to other methods. 


(D. Teichroew) 
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SCARF, H. (Stanford University, California) 


Bayes solutions of the statistical inventory problem—ZIn English 


Ann. Math. Statist. (1959) 30, 490-508 (8 refs.) 


In the inventory problem a sequence of purchases are made 
at the beginning of a number of regularly spaced time 
periods. In addition to the purchasing cost, which may be 
a general function of the amount purchased, it is customary 
to charge a holding cost at the end of each period if excess 
stock is on hand or a shortage cost if there is a deficit of 
stock. Demands are usually assumed to be independently 
distributed and with a common known distribution. ‘T'he 
solution to the problem is a set of purchasing decisions 
which minimises total expected cost: sometimes the cost 
is discounted. 

In this paper we remove the assumption that the demand 
is known, and replace it by the assumption that the demand 
distribution contains an unknown parameter (specifically 
from the exponential family) and that the parameter has a 
known a priori Bayes distribution. 

This feature adds considerable complexity and other 
features are simplified in order to obtain a manageable 
answer. ‘The primary simplification is that the purchase 
cost is assumed to be proportional to the amount purchased. 

It is then shown that the optimal policy is defined by 
a sequence of functions x, (mean observed demand), such 
that if at the beginning of the mth period the stock level is 
less than x,, the difference is purchased and otherwise no 
purchases are made. 
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0.8 (1.8) 


The functions x, are shown to be increasing in the 
mean demand, and it is also shown that purchases will 
never be made in a period if demand during the previous 
period has been too small. Finally an asymptotic expansion 
of x,, for large n, is given. 


(He Searf) 


0.6 (11.5) 


On the spectral norms of several iterative processes—In English 


$F. Ass. Comp. Mach. (1959) 6, 495-505 (7 refs.) 


A linear iterative process of first degree for solving a set of 
simultaneous linear equations 


Au = d, 


where A is a symmetric, positive definite matrix of order n, 
and u and d are column vectors, is defined by 


u™ = H,u@-) +M,,d 


where Figs Av Tennn = 12; 22 


m 
let IK SABE Sele 
n=1 
The spectral radius of a square matrix A is equal to 
the maximum of the magnitudes of the characteristic roots 
of A. The spectral norm is the positive square root of the 
spectral radium of A’A. There exists a class of iterative 
methods in which K,, is a polynomial P,,(H) of degree m 
in a given matrix H which is independent of m. Among 
these iterative methods are the Jacobi method, Richardson’s 
method, the ‘‘ best-strategy ”’ methods of Stiefel, the Young- 
Frankel successive over-relaxation method, and the linearly- 
accelerated Gauss-Seidel method. 
This paper gives, under certain assumptions, formulas 
for the connection between the spectral radius of the Jacobi 
matrix and the spectral norms of four iterative processes. 
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The first process is the Young-Frankel successive over- 
relaxation method in which 


Wie, (Lo4,0,)” where Low = U—wk)“{(1 —w)I+wE’] 


where w = the value which minimises the spectral norm 
of Lewy. It is shown that the Euclidean norm of the error 
may actually increase in the first few iterations of the 
successive over-relaxation method. For the second process 
considered 


Ue, — (Loy ,w)” *Lo4,1 


it is shown that if the matrix A has o, ordering, a smaller 
spectral norm will be obtained if the successive over- 
relaxation method for performing a single iteration with 
w = 1 is carried out first. The third process is defined by: 


Kn = Trl{2/p(B)}Lo,,1 —D]/ Tl {2/p(B)} -1]. 


where T,,,(x) are the Tchebycheff polynomials of the first 
kind. It is shown that the Euclidean norm of error goes 
to zero as for m/T,[(2/ p(B)*)—1]. For the fourth method 


LG, = Lo1 Tmal{2/(9 B)}L04,1 —I/ T m—al{2/p(B)*} =f ]. 


it is shown that the Tchebycheff accelerated Gauss-Seidel 
method can be improved by performing a single iteration 
first with w = 1 and without any “ acceleration.” 


(D. Teichroew) 
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SMITH, R. B. (Westinghouse Elec. Corp., Pittsburgh) 0.6 (1.3) 
Two theorems on inverses of finite segments of the generalised Hilbert 
matrix—In English 
Math. Tab., Wash. (1959) 13, 41-43 (5 refs.) 


The generalised Hilbert matrix is defined to be These results are proved using the relationships involving 
1 ae the binomial coefficients and then defining a function of x 
A, = (hi), hig =—-———7,_ 1,7 = 1,2, ..., n; as a series, differentiating the function and setting x equal | 
Pacircde 1 to 1 | 


py 20) oer 


The elements of the inverse of this matrix can be given as 
a closed expression, in particular the element in the ith 
row and jth column of S, is 


— 


(D. Teichroew) 


n—1 
| ae Sa CES ACES ae ey 


The following formulas are developed for sums of rows 
and columns and sums of all elements in the inverse. 


a 


n ae n .. n—1 
= S, = ince Sos i) tee (p+tj+k)/(g=1)! aj)! 
= i 20) 


n os 
rah S? = n(p+n). 


oe a 
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STEINHAUS, H. & TRYBULA, S. (Math. Inst., Polish Academy of Sciences, Wroclaw) 0.7 (1.7) 
Measurement by successive comparison—In Polish 
Zastosowania Mat. (1959) 4, 204-212 


2 ee ee 


In this paper the authors give the solution, by the methods 
of the theory of games, of a few practical problems in the 
sphere of approximate measurements of certain quantities. 
For example, comparing successively an unknown quantity 
x€{0, 1) with known quantities t), tg, ..., tn. On the basis 
of the results of those comparisons we give the approximate 
value of x’ of x. The functions t,, tz, ..., ty and the estimate 
x’ should be chosen so as to minimise W(| x—x’ |). This 
problem may be regarded as a game of man against nature, 
in which man chooses fy, ts, -.., tn and x’, and nature chooses 
x. Man pays to nature W(| x—x' |). We prove that if 
W is an increasing function of the argument | x —x’ |, then 
the best strategy for man in the minimax sense is t; = 3; 
t. is equal to the centre of the segment <0, 4) or (3, 1), 
according to whether the point x is in the first or in the 
second of those intervals, etc., and x’ is equal to the centre 
of the segment obtained after m steps. If x is a random 
variable with a uniform distribution, then the same 
strategy minimises the expected value of W(| «—x’ |). 

A few practical consequences of those and similar results 


are given. 


(H. Steinhaus) 


338 


pnt equtecocletey oft ania Levon sen sina’ 
t(Mitsii a -gomnteb aads bose ateotyators falta wits 


: ; ‘ * 
> * jocliine. Hire norts wil acti HIT AE iy Pees th ao ott anes ms { + 
- a | rg 


ti wt} -- s if 


mvig od nao eee i: Ww eit ty a 
in wie on ticrtpala writ velusitmg at, qa 


: al XS be oon nulng A be 
1" 7 

1 ' 

4) ‘oe 2 ny 

(t- 0 [re oerey 

ewet to one rot bogpilaveh <t eniver galwelie 


, a Lay 
oT teak oe) ie aerials The We eerie be “wate Hee: 


priigia ft’ CT) 


AAUP GA ot 


hm ah AA ey 


mee 


i~@ 
Hie wd ERE eo eed i las te 


M “ 
<li> Hh) te oe <> 
Sia 
7 — 
a 
= i 
= 
vA (wate? enone 1h whit, delat teat chi ADIT 2 a ta =f 
Aaidatt, ah -~fpuealan qed sehay un ach. ae , 
FOL & (COOL) Anh nivnimentin 
7 chodwan on? vd .neitidos ochh rig wri ods 2 
a4 oct itt artroldar rey hac daeiq wot «to mune to. 
a ~Witewo cere Yo TNT ERNE otomixmn 
: 


Jtunadp owootan aw viz Wiese atcmsypnns sig 
eat oft nO) Qh ..\. oh wk ectienepy eters aie € e 
; Ebestgge sh avi ave commbegaris said We # 
siaraiieas ols at 1... el 3 anion? ofl pete, 
“HT (| “e-<s YT Olina nian nasods oh Bb 
STU? CORA a sitieg « ht bobreget od van o 
eee rg aie Lewd : ‘. ores a. vat ae) eonooels omer ho 
i tet avenq OW See wees of ayoq nol 
49 | “won, ) tena rg) elt ly suitvenlt  eeuanel 
: f= |) ah-aeeeas aun wt al cote wl 

cA! 2) 90 OL) OD Serge ot te attcres agty: Otel 
ut? in 36 2078 one al a # Seto pall meipretoge i 
: stfeso wll mn Tatype at ‘y. betejora Mersin wads ty. 

abet ¢ a 6 3) @epte Ww Telly Berbers Same: 
im ow wll Orth eilian 
=f ‘e= «1 nul homoges al? x 
Nuseot tylerite bow ae te enimppeecwe f 


(corletates Dp) 


SUZUKI, YUKIC (Inst. Statist. Math., Tokyo) 


Note on linear programming—In English 


0.8 (—.-) 


Ann. Inst. Stat. Math., Tokyo (1959) 10, 89-105 (11 figs.) 


In this paper, the author treats some computational methods 
for linear programming. He raises the question whether 
there is any method using the optimal solution on a basis 
of an initial linear programming problem for solving various 
new problems which are derived by varying the values of 
coefficients of the restriction of the initial linear programming 
problem. The method used here is essentially the same 
with the Dantzig’s simplex method. 

In section 2, the case where the value of the constraint 
vector is changed is treated. In this case, the optimal basis 
B® of the initial problem is not always the one corresponding 
to an optimal solution of the new problem and furthermore 
it may not be feasible for the new problem. When it is 
feasible, then an optimal solution of the new problem is 
only obtained by further simplex algorithm. However, in 
case it is not feasible for the new problem, the author con- 
structs a modified linear programming problem which has 
B* as a feasible basis and whose optimal solution coincides 
with the one of the new problem, where B* is constructed 
by changing signs of some vectors in B®. The author 
says that, when the change of the constraint vector is not 
so great, the modified problem may be solved with fewer 
stages. 

In section 3 the author considers the case where the 
coefficient matrix A of the linear equations or inequalities 
of the restriction is changed. He only discusses the special 
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case where a single column of A is changed. He splits the 
problem into the following two cases. Case (i): the vector 
changed is contained in B®. Case (ii): it is not contained 
in B®. For the former case, he constructs a modified 
problem which has B? as a feasible basis and whose optimal 
solution coincides with the one of the new problem. For 
the latter case also, such a modified problem is considered. 

In section 4, the change in coefficients of the objective 
function is discussed. The author states that here we have 
only to continue the algorithm. 

In section 5 the case where only one further restrictive 
linear equation is added to the initial restriction is discussed. 
In this case, a feasible basis of the new problem is easily 
constructed by using B® and here also the repetition of 
algorithm is required. In this section a lemma is given 
concerning a condition for existence of a feasible solution 
of the new problem. 

In section 6 the case is treated where only one of linear 
equations of the restriction is eliminated. "The method is 
analogous to that in the previous section. In this section, a 
lemma is given concerning a necessary condition for the 
existence of an optimal solution of the new problem. 


(T. Uematu) 


0.2 (11.5) 


Polynomial curve fitting on electronic computers—In English 


Nat. Conv. Trans., Amer. Soc. Qual. Contr. (1959), 565-573 (3 refs.) 


Polynomial curve fitting is simplified by the use of available 
tables of orthogonal polynomials when the n values of the 
independent variables are equally spaced. When the values 
of the independent variable are not equi-spaced and n is 
large, construction of orthogonal polynomials is expensive 
and difficult even with large computers. 

Computing difficulties arise mainly from the condition 
of the square matrix A of the normal equations Ab = g. 
The problem discussed is that of finding a method of 
transforming the independent variables such that a reasonable 
inverse, C, of A is obtained. It is necessary to raise the 
question of deciding when a matrix C is the inverse of a 
matrix A. Mathematically, the answer is trivial, CA = I. 
The answer is not trivial when C has been computed and 
the product CA = D has been calculated. The elements 
d;; are found to fall within 1+£;; on the diagonal and 
O-E;,; off the diagonal, where E;; is a small positive number. 
If C, is an approximate inverse of A, then b) = Cog and 
Ab, = go. The two vectors with elements go;—gi OF 
(go: —&s)/gi, where g; #0, have been suggested as criteria 
for deciding when sufficient accuracy has been achieved. 
The criterion might be the specification of an upper bound 
for any element of go:—g: or (o:—g)/gi- No method is 
known for evaluating C, element by element. 

A transformation is needed which will yield a matrix 
which can be inverted, which yields equal precision in the 
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estimates of all the coefficients, and which is nearly an 
orthogonal transformation. ‘The transformation defined in 
terms of the Tchebysheff polynomials meets these criteria. 
These polynomials are 


T(x) = cosj arc cos x —1 1 
or T (x) = cos 7 are cos (2x —1) Oa 
Other properties of these polynomials are 

Max T,(x) = | Min T(x) | = 1 

within the interval (—1, 1); these boundaries are attained 
for all values of 7. The Tchebysheff polynomials have a 
higher average error in approximating a polynomial than 
the Legendre polynomials but have a lower maximum error. 

Since the power series expression for 7;(«) has alternate 
zero coefficients when —1 XxX, it is better to apply the 
initial transformation to the independent variable x’ as 

x = (2x —% max =H win)| (max Xan 

Remarkable gains in accuracy are afforded by the use 

of this Tchebysheff transformation. 


(D. W. Gaylor) 
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WILKINSON, J. H. (National Physical Lab., Teddington) 


0.6 (11.5) 


Stability of the reduction of a matrix to almost triangular and triangular forms 
by elementary similarity transformations—In English 
JF. Ass. Comp. Mach. (1959) 6, 336-359 (5 refs., 6 tables) 


In the recent paper Givens [¥. Soc. Industr. Appl. Math. 
(1958) 6, 26-50] has described a method of reducing a 
general matrix to almost triangular form by a succession 
of simple unitary transformations. In the same paper he 
described a second sequence of unitary transformations for 
reducing the almost triangular matrix to a triangular matrix. 
This second transformation is performed in n stages and 
at each stage one eigenvalue and one eigenvector of the 
current transformed matrix (and hence of the original 
matrix) are determined, the vector being obtained as a by- 
product of the transformation itself. 

In this paper the author describes two alternative methods 
for reducing a general matrix to almost triangular form by 
means of similarity transformations which are derived as 
the product of a number of simple elementary similarity 
transformations. Such transformations have been described 
elsewhere, but here the emphasis is on numerical stability 
and details are given of a method of organising the com- 
putation in fixed-point arithmetic so that rounding errors 
are kept to a minimum. 

A similar alternative is described for the further reduc- 
tion to triangular form and it is shown that this second 
transformation suffers from numerical instability which will 
sometimes render it valueless for finding the eigenvector. 
The same criticism applies with equal force to the process 
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described by Givens for the reduction from almost triangular 
to triangular form. It is shown that the instability is of a 
similar nature to that analysed in a recent paper by Wilkinson 
on the calculation of the eigenvectors of triple diagonal 
forms. The method which is suggested for overcoming this 
instability has been programmed for DEUCE and has given 
good results in practice. 

In general the methods of this paper involve considerably 
less computation and programming than those described 
by Givens; so much so, that on DEUCE they can be 
performed using double precision in about the same time 
as can the Givens transformations using single precision. 
Experience with the two methods has suggested that they 
give comparable accuracy when both are performed using 
single precision arithmetic. In the final section a numerical 
example is given showing results which are typical of those 
which have been obtained in general. 


(J. H. Wilkinson) 
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AGNEW, R. P. (Cornell University, Ithaca, N.Y.) 


1.5 (2.4) 


Asymptotic expansions in global central limit theorétns—In English 


Ann. Math. Statist. (1959) 30, 721-737 (7 refs.) 


This paper is concerned with constants C,,(p) defined by 


cule) = J 


where ¢(x) is the Gaussian distribution function and F,,(x) 
is the distribution function of the sum (£,+é,+...+é,)/n}, 
each €; being a random variable having a distribution 
function F(x). The case p = 2 is of primary interest. The 
author cites references wherein it is shown that the C,(p) 
exist if p>4 and have limit zero in m. For F(x) as the 
symmetric binomial distribution function and F(x) as a 
uniform distribution function, first order asymptotic expres- 
sions for the C,,(2) are given. 

The primary purpose of this paper is to establish 
asymptotic expansions for the C,,(2) of the form 


| F(x) —4(x) |Pdx 


id 
C,(2) = 5 Dn*+0(n-*), 
2— 


The author carefully investigates the conditions under 
which such expansions are valid. He has given a rather 
detailed description of the work which was carried out to 
obtain these expansions. This work also resulted in formulae 
for the D; in terms of the moments of F(x); these are shown. 
A portion of the calculations for D; is separable into two 
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AKAIKE, H. (Institute of Statistical Mathematics, Tokyo) 


cases: different approaches are needed depending on 
whether or not the magnitude of the characteristic function 
of F(x) is or is not periodic. 

As a special example the author considers the binomial 
distribution function. For the general, non-symmetric 
case only D, is computed, with an indication that further 
straightforward calculations of higher D,; are possible but 
tedious. ‘The symmetric case is considered in rather greater 
detail, the first ten terms of the sequence {C,(2)} being 
explicitly given. In addition, an approximate formula for 
C,(2), apparently good for n=5 is obtained. The procedure 
for calculating the C,(2) exactly is derived and presented 
in a form which can be used for numerical work. The 
exact method is, however, much more laborious than the 
approximate one. 


(D. H. Shaffer) 


Abstractor’s Note: the first term on the right of expres- 
sion (7.61) should read 0:09403 15973 n-1. 


(D. H. Shaffer) 


1.5 (2.2) 


On a successive transformation of probability distribution and its application 
to the analysis of the optimum gradient method—In English 
Ann. Inst. Statist. Math., Tokyo (1959) 11, 1-16 (8 refs., 4 figs.) 


In this paper is defined a kind of transformation of probability 
distributions over a set of real finite numbers, say n. The 
transformation is such that, when we apply it to a distribu- 
tion, the probability of each point is multiplied by a factor 
proportional to the square of the distance of the point to 
the mean of the distribution to give a new probability 
distribution. It is shown that by such transformation the 
variance of the distribution never decreases. From this 
fact it can be seen that if we consider the probability 
distributions as points in an n-dimensional Euclidian space, 
the limit points of the set of distributions obtained by 
successive applications of the transformation to some fixed 
initial distribution are restricted to those which have their 
total probability attached only to the two extreme points— 
the largest and the smallest. Further it is shown that the 
number of limit points is not more than two and if there 
are two limit points they are the transforms of each other. 
Thus, the sequence usually is “‘ oscillatorily convergent.” 
This result offers the theoretical foundation for an 
acceleration procedure of the optimum gradient method 
proposed by Forsythe and Motzkin in their paper ‘“ Asymp- 
totic properties of the optimum gradient method ” [Bull. 
Amer. Math. Soc. (1951) 57, 183]. A min-max proof for 
the least upper bound of the convergence rate which was 
first obtained by Kantrovitch is given and further we can 
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see that if the matrix is ill-conditioned, then without 
acceleration procedure the convergence rate tends near to 
its worst possible value as the approximation proceeds. 
Some numerical examples are given by using the same 
data as those treated by Forsythe in ‘‘ Punched-card experi- 
ments with accelerated gradient methods for linear equations”’ 
[Nat. Bur. Stand., Applied Mathematics Series No. 39 (1954), 
55-69] and good agreement is noted between these examples 
and the theoretical observations. 


(H. Akaike) 
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BARTON, D. E. & DAVID, FLORENCE N. 
Contagious occupancy—lIn English 


(University College, London) 


1.3 (4.6) 


J. R. Statist. Soc. B (1959) 21, 120-133 (23 refs., 4 tables, 3 figs.) 


The problem of the distribution of elements into boxes 
(cells, groups, categories, compartments, etc.) is one of 
some antiquity, dating back to De Moivre. The authors 
offer a generalised approach to the problem by which 
known results to the problem can be attained and offer also 
a new contagious-type distribution. Assume N boxes and 
n balls dropped randomly. Let N Poisson variables 
t1, t2, ..., ty characterise the numbers in each of the N 
boxes. From the joint probability law of the ¢’s, with n 
fixed the Vth factorial moment of the distribution of k, 
the number of boxes which contain no ball can be deduced 
and hence the probability law of k. This distribution is 
that found by De Moivre. If instead of a Poisson variable 
a positive binomial variable is postulated, this is equivalent 
.to putting an upper bound to the capacity of the box. The 
distribution of k under this set-up is that as previously 
deduced by Romanovsky. In assuming, as a third possibility, 
that the distribution within a cell is that of a negative 
binomial, the authors are introducing the idea of contagion 
as first proposed by Liiders. The distribution of the number 
of empty boxes is new. Finally, the p of the positive binomial 
is allowed to vary uniformly o<p<i1 and the resulting 
distribution of the number of empty boxes is found to be 
that derived by Freund and Posner. 

It is shown that the effect of assuming the positive (or 
negative) binomial for the individual cell distribution is 
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BARTON, D. E. & DAVID, FLORENCE N. (University College, London) 


The dispersion of a number of species—In English 


such as to make the “zero” distribution very different 


from the classical Poisson result. Nevertheless, provided 
the ratio n/N is fixed all the zero distributions discussed 
tend rapidly to that of the normal with increasing N. Using 
the normal approximations of the classical distribution of 
zeros for the distribution of the criterion under the null 
hypothesis, and the contagion distribution for the distribu- 
tion of the criterion under the alternative hypothesis, the 
appropriate number of boxes (relative to the number of 
balls) is discussed. All other things being equal, it is 
decided that the number of boxes should be as large as 
possible. 

‘The number of zeros when the individual cell distribution 
is a positive binomial is suggested as a criterion for analysing 
the randomness of points on a lattice. If the alternative 
hypothesis is that of “ fluctuation ”’ occupancy it is suggested 
that the lattice should be so divided as to make the number 
of boxes equal the number of balls. 

Finally, the zero criterion is compared with the index 
of dispersion. 


(Florence N. David) 


1.3 (5.6) 


J. R. Statist. Soc. B (1959) 21, 190-194 (1 ref., 1 table) 


The problem of the distribution of various species of plants 
in a field (divided into equal areas) is discussed in terms 
of balls of various colours distributed into N boxes. 

It is shown that the number of boxes containing balls 
of both of two colours is distributed hypergeometrically 
(conditional on given numbers of boxes with at least one 
of a given colour) when the different colours are distributed 
independently. This is generalised for k colours. Four 
models previously treated by the authors [7. R. Statist. Soc. 
B (1959) 21, 120-133] are considered as generating the 
occupancy distribution for a given colour: (a) random, 
(b) not more than d balls per box, (c) contagious occupancy 
(balls tending to fall together) and (d) with a varying degree 
of density of balls from box to box. The probability dis- 
tribution function of the number of boxes with both colours 
is given under (a). The number of different colours 
represented in a box is then considered, and its mean and 
variance given for models (a), (6) and (c). It is shown to be 
approximately a binomial variable (exactly so when the 
overall numbers of each colour are the same), and a numerical 
example shows graduation by the binomial probability 
distribution function to be satisfactory. 

A generalised (k-colour) form of model (b) is discussed 
where the total of all colours in a box is not more than m. 
If the total of all balls in a given box is less than m by mp, 
then it may be regarded as containing 1 blanks and 
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Ny, ...Mn Of the different colours. The results may be 
arranged as an NX(k+1) contingency table. ‘The usual 
joint distribution of frequencies is shown to hold so that 
the standard y? test for independence of the different colours 
may be used. As m becomes large this is shown to be 
equivalent to the sum of the separate Poisson indices of 
dispersion for each colour. The implication is that this 
is the appropriate test for independent distribution of 
colours under model (a). 


(D. E. Barton) 


Editorial Note: see abstract No. 345, 1.3. 
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BHARUCHA-REID, A. T. (University of Oregon, Eugene) 1.5 (1.6) 
On the convergence of a sequence of generalised random variables in an 
Orlicz-space—In German 
Bull. Acad. Polon. Sci. U1 (1959) 7, 425-427 


The paper contains three theorems which give the necessary 
and sufficient conditions for the convergence of a sequence 
of generalised random variables in a separable Orlicz-space 
to a generalised random variable in the three following 
senses: 


(i) weakly sequential almost surely, 
(ii) weakly almost surely, 
(iii) strongly almost surely. 


(L. Kubik) 
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BHARUCHA-REID, A. T. (University of Oregon, Eugene) 1.0 (10.0) 
On random operator equations in Banach spaces—In English 


Bull. Acad. Polon. Sci. U1 (1959) 7, 561-564 


In this note the following random operator equation is 
considered: 
(T(w) —ADx(u, ») = yu, «) 


where x(u, w) and y(u, w) are generalised random variables 
with values in a Banach space; T(w) is a random operator 
and I is the identity operator. This equation arises in the 
study of certain linear functional equations in probabilistic 
functional analysis, especially in the theory of stochastic 
boundary value problems. 

The author studies the existence and measurability of 
the resolvent operator associated with a random operator 


T(w). 


(L. Kubik) 
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BRAUMANN, P. (Faculty of Sciences, Lisbon) 1.1 (--) 
Symmetric infinitely devisible probability law—In German ) 
Rev. Fac. Ciencias Lisboa, A (1959) 7, 255-262 


A law is called indefinitely divisible if its characteristic 
function is, for every n, the characteristic function of the 
sum of n identically distributed random variables. 

For the characteristic functions of the indefinitely 
divisible random variables we can give some representations 
—of Levy-Khintchine, Levy and Kolmogoroff. 

The author defines the symmetry of an indefinitely 
divisible law and obtains for each representation the necessary 
and sufficient conditions for the symmetry of an indefinitely 
divisible law. 


(J. T. de Oliveira) 
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CIUCU, G. (Bucharest University) 1.5 (10.1) 
An exceptional case for the limit distribution function of sums of random variables of a 
stationary continuous Markoff process with a finite set of states—In Rumanian 


Gaz. mat. si fiz. A (1958) 3, 141-142 (2 refs.) 


‘The asymptotic behaviour is investigated of the distribution 
function of sums of random variables forming a stationary 
continuous Markoff process with a finite set of states. ‘The 
functional equations verified by the characteristic functions 
of finite sums are given, the limit characteristic function 
being determined as the limit of these characteristic functions 
corresponding to finite sums. Moreover, the associated 
system of equations with partial derivations is deduced. By 
making use of certain boundary conditions and of certain 
results given previously by Onicescu [Analele Univ. C. I. 
Parhon—Bucuresti, Seria St Naturii (1956) 12, 9-12]. The 
limit distribution function of the normally distributed sums 
is obtained as a sum of a Gaussian distribution function 
and of an error function of an oscillating type. 


(G. Simboan) 
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DANTZIG, the late D. VAN. 


(Mathematical Centre, Amsterdam) 


1.8 (10.4) 


On some questions in mathematical decision theory—In French 
Report SP 69—Statist. Dept., Math. Centre, Amsterdam (1959) 22 pp. (27 refs.) 


Two applications and a critical survey of mathematical 
decision theory are given. The first example concerns the 
optimal height of the dykes of the Netherlands and the 
second a new approach by De Leve to solve a large class 
of generalised inventory problems. 

The author criticises the notions of subjective probability 
and utility. The following example is given to show that 
maximising expected utility can be a very ineffective way 
of optomising. 

In a game, the player gains 2 with probability g = 1—p, 
and loses q/p with probability p, both 2 and gq/p being 
expressed in differences of utilities. The expected value 
of the gain equals g. Suppose the player can choose p 
arbitrarily under the condition that the amount to be paid, 
if he loses, will have to be deposited before playing. So 
F being his capital, maximising expected utility entails 
maximising gq under the condition g/p<F. The optimal 
value of p equals 1/(F-+1) and losing means ruin. If this 
game is played a large number of times, it can be shown 
that ruin will certainly occur in a finite number of plays. 
Moreover, a come-back is impossible. If expected utility 
is not maximised, a lot of strategies are available, which 
never imply ruin and give a reasonable chance of gain. 
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It is pointed out that the use of Von Neumann and 
Morgenstern’s theory of coalitions does not seem to be 
very fruitful for solving problems in modern politics because 
too many unrealistic assumptions have to be made. 

Notwithstanding all criticisms, the author regrets the 
knowledge and the experience of mathematicians not being 
used more directly when really important decisions have to 
be taken. 


(J. Kriens) 


1.5 (-.-) 


On necessary and sufficient conditions for the validity of the strong law of 
large numbers expressed in terms of moments—In English 


Bull. Acad. Polon. Sci., 111 (1959) 7, 221-225 


The following theorems are proved: 

Theorem 1. "There do not exist for sequences of in- 
dependent, symmetric random variables satisfying relations 
| X;|<z @ = 1, 2, ...) necessary and sufficient conditions 
for the validity of the strong law of large numbers expressed 
in terms of the variances oF of X;. 

Theorem 2. Let r be an arbitrary given natural number. 
There do not exist for arbitrary sequences {X;,} (¢ = 1, 2, ...) 
of independent random variables necessary and sufficient 
conditions for the validity of the strong law of large numbers 
expressed in terms of the moments my, ..., Mméz, Where 


ny = EX}, 


(L. Kubik) 
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FRECHET, M. (University of Paris) 


1.6 (-.-) 


On the practical importance of “‘ almost certain convergence ’’—In French 


Rev. Int. Statist. Inst. (1958) 26, 26-28 (2 refs.) 


It has been asserted that of the two notions “ almost certain 
convergence ”’ and “ convergence in probability’, only the 
second one is of interest in applied statistics. The author 
raises some objections against this opinion: since almost 
certain convergence implies convergence in probability, 
the first notion cannot be less useful than the second one. 
Moreover, it is shown that almost certain convergence 
provides applied statisticians with more complete informa- 
tion than convergence in probability. 


(M. Fréchet) 
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GILBERT, E. N. (Bell Telephone Labs., Murray Hill, N. J.) 


Random graphs—In English 


ES =) 


Ann. Math. Statist. (1959) 30, 1141-1144 (4 refs., 1 table) 


The following two paragraphs are a direct quotation from 
the introduction to the paper being abstracted :— 

** Let N points, numbered 1, 2, ..., N, be given. There 
are N(N—1)/2 lines which can be drawn joining pairs of 
these points. Choosing a subset of these lines to draw, 
one obtains a graph; there are 24("—1/2 possible graphs in 
total. Pick one of these graphs by the following random 
process. For all pairs of points make random choices, 
independent of each other, whether or not to join the 
points of the pair by a line. Let the common probability 
of joining be p. Equivalently, one may erase lines, with 
common probability g = 1—p from the complete graph. 

“In the random graph so constructed, one says that 
point 7 is connected to point j if some of the lines of the 
graph form a path from 7 to j. If z is connected to 7 for 
every pair 7, j, then the graph is said to be connected. The 
probability Py that the graph is connected, and also the 
probability Ry that two specific points . . . are connected, 
will both be found.” 

A generating series for Py is derived on the basis of a 
result stated in a previous paper. From this an explicit 
representation of Py is derived, as well as a recurrence 
relation 


N-1 noe 
1—Py = 2 a rater», P, = 1, 
kel 
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which is easier to use for large N than the explicit formula. 
Similarly, the recurrence relation for Ry takes the form 


N-1 yon 
try 8 (XH?) ter» 
(Gan = 
A small table of values of Py and Ry for several values of q 
is presented. 
For a sufficiently large value of N so that the above 

recurrence relations are unwieldly, it is shown that 

Py = 1—Nq¥4+0(N%qN/2) 

Ry = 1—2qN-1+0(N@?4/?) 


(R. M. Durstine) 
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HARRIS, B. (Stanford University, California) 


1.2 (2.4) 


Determining bounds on integrals with applications to cataloging problems—IJn English 
Ann. Math. Statist. (1959) 30, 521-548 (8 refs., 9 tables) 


We assume that a random sample of size N has been drawn 
from a multinomial population with an unknown and 
perhaps countably infinite number of classes. The experi- 
menter wishes to predict d(x), the number of classes that 
will be observed in a second sample of size aN, a>1, (or 
when the sample size is increased by («—1)N additional 
observations); and C(«), the coverage of a second sample 
(or augmented sample), where C(«) = 2p,, the sum is to 
be taken over those classes for which at least one represent- 
ative has been observed in the sample. 
It is shown that 


E{d(«)}~E(d) +E(m,)E*{1/x —1/x exp («—1)x} 
and 


E{C(«#)}~1 —E(n,)/N —{E(m,)|N}E*(1 [3 —1 x exp(a—1)x} 


where d is the number of classes observed, 7, is the number 
of classes occurring once in the sample, and E* denotes 
an expected value taken with respect to a distribution 
function unknown to the experimenter since it depends 
on all the unknown parameters {p;}. It can, however, be 
shown that the moments of this distribution function are 
approximately given by certain functions of the expected 
values of n,, the number of cells occurring r times in the 
sample. Hence, if we replace the expected values of n, 
by the observed values, we have estimates of the moments 
of this distribution. For sufficiently large N, we may 
suppose for fixed k, that the first k moments are known. 
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IHM, P. (Botanic Institute, Freiburg, Germany) 


Thus, if in E{d,(«)} and E{C(«)} we replace the expected 
values of d and n, by their observed values, then, we need 
only determine the remaining expected values. 

A reasonable procedure is to determine upper and lower 
predictors of d(x) and C(«) by determining the suprema 
and infima of expected values subject to moment constraints. 

The author proceeds to show that if g(x) is a continuous, 
bounded, and monotonic function on [0, b]; and its first 
k derivatives exist and are monotonic in (0, b), and g(x) is 
not linearly dependent on 1, x, x?, ..., x*, then we can easily 
determine the supremum and infimum of the expected 
value of g(x), where the extremisation is over all distribution 
functions on [0, 6], whose first k moments are specified. 
It is shown that the two extremising distribution functions 
are uniquely determined, one being continuous at b, and 
one has a saltus at b. The extremising distribution functions 
are characterised and the case b+co is also considered. 
The explicit solutions for k = 0, 1, 2, 3 are computed and 
applied to the specific problem at hand. 

The results in the paper are compared with formulations 
of the same problem by Fisher et al., Good, and Goodman, 
and several numerical examples are given. 

Several related results are also obtained and low order 
moments of d,n,,and C are computed. The large sample 
exponential approximations for E(d), E(C), and E(n,) are 
developed in appendices. 


(B. Harris) 


1.0 (0.3) 


Numerical evaluation of certain multivariate normal integrals—In English 


Sankhya (1959) 21, 363-366 (6 refs., 1 fig.) 


A method of integrating the multivariate normal density 
over rectangular parallelopipeds is presented. The problem 
is reduced to the evaluation of a single integral. The 
integrand is a product of a number of integrals of the uni- 
variate normal density over one dimensional intervals and 
a normal density. Such a reduction is possible whenever 
the covariance matrix A has the form A = D+c~*ii’, 7 being 
the unit vector (1, 1, ..., 1), Da diagonal matrix with positive 
elements in the diagonal cells. The final integration has to 
be done by numerical quadrature. The author suggests 
that by a certain transformation of the variable of integration, 
results of sufficient accuracy for all ordinary purposes can 
be obtained with the help of a planimeter. 


(S. John) 
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ISH, K. (Inst. Statist. Math., Tokyo) 


On a method for generalisations of Tchebycheft’s inequality—In English 


152) 


Ann. Inst. Statist. Math., Tokyo (1959) 10, 65-88 (11 refs., 7 figs.) 


Numerous authors have contributed to the generalisation 
of T’chebycheff’s classical inequality. In this paper, however, 
a sufficiently general new theory is developed upon a unified 
method of obtaining probability inequalities. 

Suppose that the first 2x moments py, ..., fuon Of a one- 
dimensional random variable X are given, and let E be 
any closed (or open) set without any such restrictions as 
“semi-infinite” or “‘ symmetrical about the origin”. The 
best possible upper and lower bounds of P,{X € E} are given 
in terms of j4, ..., an. For example, under certain conditions 
the upper bound is the supremum of linear forms of yy) = 1, 
Hi» +++) Ban With the same coefficients as the polynomials 
which dominate the indicator yx (x) of the set E. 

When E£ is a closed set, the upper extremal distribution 
has the following properties: either (i) the extremal distri- 
bution has its first 27 moments equal to 4, ..., en and its 
spectrum consists of at least n+1 points and at most 3n 
points, or (ii) the extremal distribution has its first 2n—1 
moments equal to 4, ..., 2,1 and the 2th moment less 
than. }42,, and the spectrum consists of at least m points and 
at most 3z—3 points. When £ is an open set, the statement 
about the lower extremal distribution is true. The results 
are fairly general and cover the classical results of 
Tchebycheff-Markoff-Stieltjes and that of Wald. Proofs 
of the statements are essentially reduced to the extension 
problem of positive linear functionals on Banach spaces. 
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LAHA, R. G. & LUKACS, E. (The Catholic Univ. of America, Washington, D.C.) 


In the Jast section the author considers three typical 
examples: a discrete set, an interval symmetrical about the 
mean, and two-dimensional cases. 

In Example 1, the necessary and sufficient condition 
that a discrete distribution on the point set {&, ..., fon} 
has the moments py, ..., Men iS given in terms of the two 
sets of values. 

In Example 2, all possible cases of upper bounds are 
shown with the character of the extreme distributions. 

In Example 3, two-dimensional distributions with the 
given means, variances and covariance are considered. 
Regions, on which the probability measure is evaluated, 
are restricted to two-dimensional open intervals with the 
length proportional to the standard deviation of the corres- 
ponding axis. The mean point of the distribution is assumed 
to be on one of the two diagonals of the regions. The 
sharp lower bounds of the probability are given in three 
cases according to the position of the mean point. 


(M. Sibuya) 


1.2 (2.0) 


On distributions whose moments are majorated by moments of a known distribution—IJn English 


J. Aust. Math. Soc. (1959) 1, 113-115 (2 refs.) 


Let a), o(?) denote the kth moments about x = 0, and 
fit), f2(t) the characteristic functions, of two distribution 
functions F(x), F(x). Suppose that f,(t) is an analytic 
function of t, and that for any k, | «4 | < | «?) |. Using 
well-known properties of analytic functions, the authors 
prove that the circle of convergence of the Maclaurin series 
for f; contains the circle of convergence for f,, and that in 
the case of f, being an entire function f, is also an entire 
function with order not exceeding the order of f,. The 
method of proof is simpler than that used by Milicer- 
Gruzewska (1949) for a particular case—essentially F(x) 
being normal with zero mean. An example is given showing 
that the strip of convergence (parallel to the imaginary axis 
in the complex t-plane) of the integral defining f,(t) may not 
contain that of f,(t). 


(E. W. Bowen) 
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ONICESCU, O. (Bucharest University) 
Notes on Boolean algebras—In French 
Rev. Math. Pures Appl. (1959) 4, 345-350 (6 refs.) 


This paper is divided into three parts. In the first part 
the structure of a probabilised Boolean algebra A is in- 
vestigated. The notions of atom and quasi-atom are 
introduced by making use of indecomposability respectively 
of the elements of the Boolean algebra and of their corres- 
ponding measures. Further, considering the Boolean 
algebras of atoms and quasi-atoms and the so-called con- 
tinuous kernel, a canonical decomposition of A is given, 
as for the corresponding measure. Moreover, it is shown 
that the continuous kernel is characterised by the Darboux 
property. 

In the second part it is proved that any part of A has 
a maximal element. 

Starting from a denumerable additive Boolean algebra 
A, in the last part the author suggests a method for con- 
structing a punctual space and a Borelian field isomorphic 


to A. 


(R. Theodorescu) 
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PRATT, J. W. (Harvard University) 
On a general concept of “‘ in probability ”—Jn English 
Ann. Math. Statist. (1959) 30, 549-556 (6 refs.) 


Mann and Wold have given a general theory and notation 
for stochastic limits and limit distributions [4nun. Math. 
Statist. (1943) 14, 217-226]. In so doing they gave a 
probability meaning to the O and o notation that has proved 
so useful in analysis. In the present paper the definition 
of “in probability’? is broadened and clarification is 
attempted by separating the stochastic and limiting aspects 
of the problem. 

Thus, given sequences of random variable {X,} and 
numbers {r,}, the usual definition of 0, is as follows: 
X, = 0,(rn) if for every positive «, 7 there exists N such 
that for n>N P,[xn/7n Sn] 21—<«. In words the probability 
that x,,/7,, is small tends to unity as m increases. 

The two aspects of the problem are separated if the 
definition reads X,, = 0,(r,) if for every positive « there 
exists c,, with P, | Xn | = erand c, —"0(%p): 

This form of the definition is generalised in terms of 
sets in the product space of the random variables, and it 
is shown that usual theorems on interchangeability of 
functional operations and stochastic limits are true and in 
fact are almost trivial. Some examples are given and some 
remarks made about possible further generalisations. 


(D. G. Chapman) 
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1.0 (0.0) 


1.6 (1.5) 
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SANKARANARAYANAN, G. (Annamalai University, Madras) 1.5 (-.-) 


A note on the equi-distribution of sums of independent random variables—In English 
Indian Math. Soc. F. (1958) 22, 93-98 (1 ref.) 


If Xj, X2, ... is a sequence of independent random variables 
with a common distribution function and S,, = X it... +x, 
then Robbins has proved that the class H of functions h(x), 
such that with probability one 


ee ae ee 

lim — J h(S;) = M(h) 

no MN j=1 
where M(h) is a suitably defined mean, contains all uniformly 
almost periodic functions. The author proves the same 
result when S; is defined as 

Sey Pune Seer Pe ar +Xp,.+..p, 

where ~i<P.<... is a sequence of integers increasing to 


infinity. 
(K. R. Parthasarathy) 
| 
| 
361 
THEODORESCU, R. (Institute of Mathematics, Bucharest) 1-156) 


On an abstract integral in probability—In English 
Rev. Math. Pures Appl. (1958) 3, 277-281 (5 refs.) 


Let us consider a random vector _X, i.e. an application of an 
* interval U into the set of all random elements (in the sense 
of Mourier), defined on a fixed probability space and with 
values in a given Banach space. A random vector K is 
said to be weakly integrable in probability if every linear 
functional of X is integrable in probability on all subintervals 
U, of U—and if there is a random element X corresponding 
to each subinterval U, of U such that the integral in 
probability on U, of every linear functional of X is equal to 
the random variable formed by applying this linear functional 
tox: By definition, the random element X corresponding 
to U, is the weak integral in probability on Uy of X. This 
integral is uniquely defined almost everywhere with respect 
to the probability measure occurring in the basic probability 
space. Several properties of this integral are then given, 
in analogy to those of the Pettis- and the Fan-integral. 


(G. Simboan) 
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TRYBULA, S. (Math. Inst., Polish Academy of Sciences, Wroclaw) 
On lower bounds for probability moments—In English 
Bull. Acad. Polon. Sci. III (1959) 7, 267-269 


Suppose that the random variable X has an absolutely 
continuous distribution with density function f(x). The 


(eo) 
nth probability moment 7, is defined by 7, = i f "(x)dx. 


In this note the author gives the greatest lower bound of 
parameter 7, for random variables with finite kth central 
absolute moment. 


(L. Kubik) 
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ADAM, A. (Institiit ftir Statistik, Universitat Wien) 
Measures and indices of distributions—In German 
Metrika (1959) 2, 239-244 (4 refs.) 


The paper deals with a measure of distribution and indicates 

- some of its properties. Let x, y, z be some attribute vectors; 
p(x, y, 2) the corresponding density function. ‘The measure 
of distribution is defined by 


P(z | y, x) = exp {—E In pz | y, x) 20. 


The connection with the entropy of information: theory is 
obvious. ‘The author shows that the properties of P are 
very similar to corresponding properties of probabilities. 
In the case of a normally distributed one-dimensional 
variable x, P(x) = V2mec(x); an analogous formula holds 
in the case of a multivariate normal distribution. For all 
continuous distributions, the inequality 0 <P(«) < V 27eo(x) 
is mentioned. According to the definition of P, the variance 
has the meaning of a set of informations. 

The author then defines an index of distributions R by 
means of P: 

R(eny | x) = Plz |v, x)/P(e | x). 


He interprets R as the reduction of variance for the vectors y 
and 2 respectively if, and provided the influence of x is 
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AGGARWAL, O. P. & GUTTMAN, I. (Purdue Univ., Indiana and McGill Univ., Quebec) 


2.1 (10.5) 


eliminated, some dependence exists between y and x. For 
continuous distribution functions the result 


R(xny) SR(Axny) 


holds for non-singular linear transformations A. Some other 
properties of R are mentioned in the paper, especially an 
analogy to Bayes’ rule. R is considered as a central measure 
of information. 


(J. Roppert) 


2.8 (11.1) 


Tables of the cumulative distribution functions of samples from symmetrically 


truncated normal distributions—In English 


Ann. Inst. Statist. Math., Tokyo (1959) 11, 55-68 (1 ref., 5 tables) 


Tables of the functions ®,(t) which are the cumulative 
distribution functions of the sample mean minus one-half 
unit, are given to five significant figures for samples of 
size 2 from distributions obtained by truncating at --a the 
normal distribution with zero-mean and unit variance. 
Tables are given for 7 = 1, 2, 3, 4 for values of a = 1(0°5)3 
with ¢ advancing in steps of 0-01 until the value of a. 

The density functions of these distributions were given, 
and these tables used, in the paper “‘ Truncation and tests 
of hypothesis,” [Ann. Math. Statist. (1959) 30, 230-238] 
by the same authors, in the course of examining the loss 
of power when using tests based on the assumption that 
the variable has a complete normal distribution when, in 
fact, the distribution is “‘ truncated ”’. 


(H. Akaike) 


Editorial Note: see also abstract No. 187, 2.8. 
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AITCHISON, T. 
A ptistical theory of remnants—In Paieh 


(ene of Mathematics, University of Glasgow) 


2.8 (2.6) 


J. R. Statist. Soc. B (1959) 21, 158-168 (5 refs., 3 tables) 


The probability distribution function g(y) of the fractional 
part y = x—[x] of a random variable x is studied as a basis 
for the theory of remnants. This probability distribution 
function is the sum of f(y+7) over all integral values of 
r: y is nearly uniform (rectangular) in the interval [0, 1). 
The upper bound (d) of | g(y)—1]| is studied as is 
8 = | &)—-4 |, where 285<d. Using the Poisson sum- 
mation formula—that g(y) is the sum over r of 4(2mr)exp 
(—ry2mt) where ¢ is the cumulative function of x—it is 
shown that 6<d/2x. Moreover, if o, w are the scale and 
location parameters and T is (x—,)/o, o(d+1) is the sum 
over r. of fr(r/c) where fr is the probability distribution 
function of T. Both of these results assume that ¢ is real 
valued and non-negative. 

When x is normally distributed, computation shows that 
276=d whilst d=exp(—277so?) when o>4: the Sheppard 
correction result. Since this implies that &(y) is hardly 
affected by » or o and no control of these can thus affect 
G(y), attention is given to the case where o is small. It is 
shown that &(y) has two stationary values, a minimum at 
Ay > and a maximum at 1—A,># where A, = n—[pI], ie., 
the fractional part of p. 

Computation of A, is discussed and its value with corres- 
ponding values of &(y) and Var(y) are tabled: they are 
found to increase from zero at o = 0 to nearly rectangular 
values for c>0-4 (Table 2). It follows that a small decrease 


367 


AMATO, V: (University of Catania) 


Approximate relationships between the negative binomial and the I’ distribution—ZJn Italian 


Industria (1959) 3-16 (11 refs., 4 tables) 


An approximate relation between the negative binomial 
distribution and the gamma distribution is given whose use 
is justified when the parameters of the negative binomial 
assumes moderately high values. The author, starting from 
the agreement of the theoretical models to the empirical 
distributions, demonstrates that in the negative binomial 
‘when one of the two parameters is allowed to diverge, its 
limit is the gamma distribution. This relation is very 
useful since the values of the gamma distribution can easily 
be found in Pearson’s tables and the negative binomial or 
Poisson’s compound distribution is particularly suitable to 
describe situations where the average can vary from case 
to case. 


(P. Bandettini) 
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(of 2A9) in » can lead to a large increase in &(y)—from 
minimum to maximum. ‘Two tests based upon a sample 
of remnants are suggested for the hypothesis that a process 
producing articles of average length of an integer plus , 
is under control. The first uses the approximately normal 
probability distribution function of ¥y, and the second, 
dispensing with measurement, gauges the remnants against 
2X. If r pieces are found to be less than 2A, the normal 
approximation to the binomial is used. The second test 
is shown to be more powerful (Table 3). 

Other probability distribution functions of x are discussed 
e.g., Cauchy, log-normal and Pearson Type III, in so far 
as they entail modifications. Finally, the paper shows 
briefly how the theory may be extended to k-dimensional 
remnants: when 7 is multinomial the probability distribution 
function is obtained as an infinite series analogous to the 
univariate case. 


(D. E. Barton) 


2.5 (2.7) 


BODMER, W. F. 
¥. R. Statist. Soc. B (1959) 21, 134-143 (8 refs.) 


Any integer may be factorised uniquely as a product of 
powers of distinct prime factors. If a given integer has 
for factors @ primes to the («-+1)th power, b primes to the 
(8+1)th and so on (with «< B>...>7=1)—the rest being 
unit powers, then it is said to have the partitional character- 
istic O = («7f?...7”). Rényi [Acad.’ Serbe. Sci., Publ. Inst. 
Math. (1955) 8, 157-163] has shown that the proportion 
of integers up to x with a given QO tends to a fixed limit 
N(Q) as x—>co and that 1/677N(Q) is the sum (over all 
sets of w different primes) of fy(p1)x...xf,(pw) where w is 
the weight of O and f,(p) = 1/(p+1)p?". For all the 
primes up to an indefinitely large number, this is a sym- 
metric function of them and N(Q) is evaluated by expressing 
this as a sum of products of the generalisation of the power 
sums (i.e. the single sums of the products of the f’s taken 
all at the same argument) and replacing these by infinite 
sums over all primes. In fact any such sum is of the form 
U,; = Xp-"(1+p)- and these rapidly convergent series are 
the basic elements computed. 

The coefficients in the relations between the symmetric 
functions are bipartitional functions which are simple 
multiples of those of David and Kendall’s Table I (the 
symmetric functions here are non-augmented) and much 
of this paper is devoted to proving Sukhatme’s rules for 
their evaluation. 
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BURKHOLDER, D. L. (University of Illinois, Urbana, U.S.A.) 


» WwW (Dept. of Genetics, Cambridge, England) 
The limiting frequencies of integers with a given partitional characteristic—In English 


2.9 (0.4) 


Those values of N(Q) which are greater than zero on 
attaining five decimal places, are tabulated and compared 
with the observed frequencies in the integers 8446 to 8775. 


The proportion T,, (of integers with « = 7, a = s) is also 


computed as is the proportion F’, (with « = r) for the values 
of « contributing to the table of N(Q). Further, Rényi’s 
numbers d,, (the sum of N(Q) for a given w). Some relative 
frequencies conditional in a and « are given and Yule’s 
measure of association is computed giving, for instance, the 
association between integers divisible by third and fourth 
prime powers. The T7,, are expressed as a symmetric 
function (as is N(Q) which is evaluated in terms of single 
sums V,, which are sums over all primes of (p —1)/(p7"+ —p)*. 
This enables an independent check to be made on the totals 
of the N(Q)’s. Likewise the F, are the differences of the 
Riemann Zeta-function and so check the total of the T,;. 


(D. E, Barton) 


2.0 (4.6) 


On the existence of a best approximation of one distribution by another of a 


given type—In English 
Ann. Math. Statist. (1959) 30, 738-742. (1 ref.) 


Let F(«) and G(x) be two distribution functions and (a, b) 
an ordered pair of real numbers with a>0. The author 
investigates the problem of finding a pair (a, b) for which 
the distribution function F(ax+b) of the type of F(x) is 
close to G(x). The approximation of the distribution 
function and the optimal pair (a, bo) are defined by 


- postulating that 


sup | F(a9x +b») —G(«) | = M 
where M = inf sup | F(ax-+b)—G(x) |. In these expres- 
sions the supremum is to be taken over all real « while the 


infimum is taken over all positive a and all real 0. 
The author gives a simple example which shows that 


an optimal pair (ao, by) does not always exist. He derives 


then two sufficient conditions (Theorems 1 and 2) for the 
existence of an optimal pair. 

Theorem 1. Let A (with respect to B) be the set of all 
points such that 1/3 SF(x)S2/3 (with respect to 
1/3 <G(«) 2/3). If both sets A and B are non- 
degenerate, an optimal pair (with a)>0), satisfying 
the above postulate exists. 

Theorem 2. If F(x) is continuous while G(x) is a step 
function which has at most a finite number of dis- 
continuity points in each finite interval, then an 
optimal pair (ao, bo) exists. 
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Theorem 3 states that if the set of optimal pairs is non- 
empty then it is a convex set. This is used to describe 
the set of optimal pairs and to obtain a condition for 
the existence of a unique, optimal pair (a, bo). 

‘The results can be applied to the problem of estimating 
a scale and location parameter by the minimum distance 
method. 


(E. Lukacs) 
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CERTAINE, J. (Nuclear Devel. Corp. of America, White Plains, N.Y.) 
On sequences of pseudo-random numbers of maximal length—In English 


2.6 (11.5) 


J. Ass. Comp. Mach. (1958) 5, 353-356 (3 refs., 1 table) 


A widely used method for generating pseudo-random 
numbers on electronic computers consists of calculating 
successive powers (modulo m) of a number a. The number 
m is usually taken to be the power of the base of the system 
which is used to represent numbers to the computer; in 
general, m will be the largest power of 2 or 10 which will fit 
into the machine. 

This paper gives the length of the largest non-repeating 
sequence that is possible for any given m and gives a method 
for determining the number a which is required to yield 
this largest sequence. Table 1 contains generating numbers 
for some of the more widely used computing machines, 
including desk calculators. 


(D. Teichroew) 
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COX, D. R. & BRANDWOOD, L. (Birkbeck College, London) 


2.8 (1.8) 


On a discriminatory problem connected with the works of Plato—In English 
J. R. Statist. Soc. B (1959) 21, 195-200 (5 refs., 3 tables) 


Plato wrote five lesser works between the Republic and the 
Laws and this paper attempts to order them in time by 
means of the frequencies of the thirty-two different patterns 
of the last five (long or short) syllables of each sentence. 
The above mentioned major works differ markedly in respect 
to these patterns and each provide some 4000 examples so 
that their proportions are taken as being population values, 


‘the proportions in the lesser works being alone subject to 


random fluctuations (these yielding from 150 to 950 
sentences). General theory in respect of k characters (here 
the thirty-two patterns) is worked out, the frequencies 
being multinomially distributed with probabilities { 6,;}, 
y=0, 1; :=1, ...k; in the extremal populations {z, }. 
If {f;} are the relative frequencies in a lesser work, the 
discriminant is 5 = 2 f;s;, where s; = log ( :/80;). 

A model for the probabilities {0A,} is an intermediate 
population 7, is proposed (0S<AS1) and this has § as 
sufficient statistic for A or, equally, for its expected value 
¢(A), shown to be monotonic in A. The estimated standard 
error of § is computed for each of the five lesser works, each 
based on its own sample. The sentence endings are supposed 
to be statistically independent and each work homogeneous. 
The latter assumption is supported by internal comparisons 
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not given in this paper between chapters in the Republic 
and the Laws. The final ordering is Republic, Timaeus, 
Sophist, Critias, Politicus, Philebus, and Laws with various 
degrees of significance for the successive paired comparisons. 
It is pointed out that the ordering attained is not in accord 
with the views of the majority of classical scholars, although 
a minority group have reached similar conclusions by 
independent argument. 


(D. E. Barton) 


FUCHS, A. (Strasbourg University, France) 
On the inequality of Slutsky—In French 
Publ. Inst. Statist. Paris (1959) 8, 161-163 (1 ref.) 


In this short paper the author gives a simple proof of the 
Slutsky inequality (between moments) and compares it 
with other similar inequalities—one being deduced from 
the well-known relationship between the arithmetic and 
geometric averages. 


(Mlle. Gervaise) 
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HEUMANN, H. & NOBIS, E. (Kabelwerk Rheydt Akt.-Ges., Rheydt, Rheinland) 


2.1 (4.5) 


2.3) (lied) 


Calculation of the average and economic consumption of thermoplastic material for 
insulating cables and conducting wires—In German 


Metrika (1959) 2, 230-238 (4 refs., 1 table, 3 figs.) 


Most electric cables and metallic conducters are covered by 
insulating thermoplastic material. This insulating cover 
has to be of a certain defined minimum thickness: cores 
of a lower thickness must be rejected. In a manufacturing 
process carried out by extruders, variation of the wall 
thickness produced cannot be avoided. In a great number 
of measurements the thickness could be found to be normally 
distributed. For every machine and process the standard 
deviation of the thickness has a known value, which is 
unaffected by changes of the mean value. Adjusting the 
extruder to an average wall thickness that is smaller or 
only insignificantly greater than the specified limit results 
in the material consumption being high. ‘This is because 
plenty of cores are not sufficiently thick and must be rejected. 
They will be burned and prepared for a new process in 
order to save the valuable metallic wire. If the average 
wall thickness produced is significantly greater than the 
specified limit, the consumption of insulating material 
will also become undesirably high. In this case, the wall 
thickness of most cores is greater than needed. 

This paper describes a method for calculating that 
value of the mean which minimises the consumption and 
loss of material. In addition to the two terms representing 


a4 


the losses caused by too small or too great differences 
between the mean and the specified limit, the equation for 
all losses contains a third term as an equivalent for the 
costs of burning and preparing the rejected cores. They 
are independent of the diameter of the wires but proportional 
to the number of cores. The total loss function is minimised. 
The result is found by a graphical solution of a transcendental 
equation. It is believed that the method discussed in this 
paper could be successfully applied to other industries. 


(H. Schauerte) 


‘sy 
We an z 
fod Ee) 
Sty cai 
a cakes 
ce 


JOHNSON, N. L. (University College, London) 


Pape ey 


On an extension of the connection between Poisson and x? distributions—In English 


Biometrika (1959) 46, 352-363 (11 refs., 3 tables) 


An extension, connecting the non-central x? distribution 
with the distribution of the difference between two in- 
dependent Poisson variables, is discussed with the purpose 
of approximating the latter distribution by an approximation 
of the non-central x? distribution. A similar approximation 
to the difference between a negative binomial variable and 
an independent Poisson variable is found by an approxima- 
tion of the non-central F distribution. 

In the case of the non-central x? distribution with even 
degrees of freedom, the Poisson probability may be evaluated 
using a Bessel function. If neither of the Poisson para- 
meters is stnall, Johnson proposes using a normal distribution 
to approximate the distribution of the difference of the 
Poisson variables: he later modifies this for skewness. 
The accuracy of the approximation is expected to increase 
with the non-centrality parameter A, Patnaik’s first non- 
central x? approximation [Biometrika (1949) 36, 202-232] 
being more accurate when A is small. This is shown in a 
table for comparison of known approximations to the upper 
and lower 5 per cent. points of the non-central y? distribu- 
tion. The type of variation, however, is not quite as 
anticipated. Patnaik’s approximation to the normal dis- 
tribution is compared with Johnson’s, and when the upper 
limit of the probability integral is large, Johnson’s approxi- 
mation gives values of this limit larger than Patnaik’s. 

The non-central F distribution is treated in a similar 
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KAO, J. H. K. (Cornell University, Ithaca, N.Y.) 


way and an analogue of the normal approximation to the 
non-central y” is given with some modification for skewness. 

A short discussion is given of the two cases (Poisson 
variable)—(negative binomial variable) and (negative bi- 
nomial variable)—(negative binomial variable) and their 
correspondence to the component of variance model in 
analysis of variance. ‘The distribution of FP”, where F’” is 
the ratio of two non-central x? distributions divided by 
their degrees of freedom, is given as a mixture of negative 
binomial distributions. 

The remainder of the paper concerns the applications 
of the concept of a “‘ mixture”’ to non-central xy”, negative 
binomial and non-central F distributions. 


(Barbara Saw) 


Editorial Note: see also abstract No. 387, 3.2. 
Pearson 


2.5 (4.3) 


The mixed Weibull distribution in life-testing of electron tubes—In English 
Nat. Conv. Trans., Amer. Soc. Qual. Contr. (1959) 267-289 (8 refs., 2 tables, 8 figs.) 


In broad terms electron tube failures can be classified into 
two types: (i) catastrophic or sudden failures and (ii) 
wear-out or delayed failures. A tube which is operating 
normally and then suddenly becomes inoperative is defined 
as a catastrophic failure. Since, in general, these failures 
are of a mechanical nature, they may appear as soon as the 
tube is put into operation; the failure rate due to catastrophic 
failures decreases with time. A tube whose quality char- 
acteristics have gradually fallen below given specifications 
is defined as a wear-out failure. Such failures are generally 
of a chemical or electrical nature. For most applications, 
very few wear-out failures occur until some particular time 
period has elapsed, after which the failure rate increases 
with time. 

For the time to failure, x, the proposed distribution 
functions for catastrophic and wear-out failures are, 
respectively, 


1 te aB sles for x=0 
F(x) = i for x0 

1 —e—(a—9) Bales for x2y2 
F(x) = to for xSy2 
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where «, %>0, 0< £,<1, B.>1, and y,20. The distribu- 
tion function of the life time of electron tubes is 


F(x) = pFy(x) + (1 —p)Fo(x) 


where p is the proportion of catastrophic failures. 
The failure rate, 2(x) is defined as 


2(x) = F'(x)/[1 —F(@)]. 


The mixed distribution function, F(x) is useful in describing 
the frequently observed situation of an initial high failure 
rate, followed by a decreasing rate and then an increasing 
rate. The initial failures are attributable to catastrophic 
failures and the later failures to wear-out. 

Graphical methods are given for estimating the para- 
meters %, B;, %, Bo, y, and p. ‘The construction of a 
confidence band for F(x) is also described. 


(D. W. Gaylor) 


KIMBALL, A. W. & LEACH, E. 


(Oak Ridge National Lab., Tennessee) 


2.2 (0.4) 


Approximate linearisation of the incomplete B-function—In English 


Biometrika (1959) 46, 214-218 (7 refs., 1 table) 


Karl Pearson pointed out that the sum of a number of 
binomial terms is equal to the incomplete beta-function 
ratio. ‘The authors are concerned with a quantity M, the 
probability of death from a dose D of radiation where 
M = I,(p, g) and x is a function of D. Pearson’s nomen- 
clature, which the authors retain, is unfortunate in this 
particular connection since x is equal to the p of the common 
binomial parlance. The authors require an appropriate 
solution of the equation M = «. They remark that, from 
observations on M and D, regression estimates can be 
obtained, but that the use of J,(p, g) as an expected value 
presents grave difficulties. They sought, therefore, for a 
transformation which would give a linear relationship 
between functions of the dependent and independent 
variables. 

Scheffé and Tukey gave an approximate solution to 
M = « in terms of the « percentage point of the y? distribu- 
tion with 2q degrees of freedom. Since this approximation 
cannot be expected to serve over the whole range of x, 
the authors give their modification of it. (An excellent 
approximation due to Tukey did not seem appropriate 
since the regression could not be made linear). The Wilson- 
Hilferty cube-root transformation for x?/2q is applied to 
the x? of the Scheffé-Tukey transformation. The authors 
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LAHA, R. G. (Catholic University of America, Washington) 


On the laws of Cauchy and Gauss—IJn English 
Ann. Math. Statist. (1959) 30, 1165-1174 (8 refs.) 


Let x and y be two independently distributed normal 
random variables with the zero mean and common variance. 
It is then known that the quotient x/y is distributed according 
to Cauchy’s law (with median zero). It is of interest to 
investigate whether the converse statement holds, since this 
could yield tests of normality. However, the converse is 
in general not true. Counter examples were given by the 

~ author [Proc. Nat. Acad. Sct. U.S.A. (1958) 44, 222-223], 

by Steck [Ann. Math. Statist. (1958) 29, 310-312], and by 

Mauldon [Quart. ¥. Math. (1956) 7, 155-160]. 

' In the present paper the author proves first the following 

theorem: 


Theorem 1. Let x and y be two independently and 
identically distributed random variables with common 
distribution function F(x). Suppose that the quotient 
w = x/y has a Cauchy distribution with zero median. 
Then F(x) is symmetric about x = 0 and is absolutely 
continuous with a density function f(x) = F’(x)>0. 
Moreover, x has an unbounded range and f(x) satisfies 
for all u the relation 


ao 
| f(x) f(ux)x dx = co/(1-+n*) where Co is a constant. 
0 = 
The author gives then a characterisation of the normal 


distribution. 


378 


accordingly finish with an approximation to the incomplete 
beta-function ratio which is an incomplete integral of a 
unit normal curve. The upper bound of the integral is 
cumbersome but the results are close to the exact values 
as may be seen from the tables. 

Tukey’s approximation is also given in the table. It 
never appears worse than the one suggested by the authors 
and is often better. A simplification of the authors own 
approximation is also given. 


(Florence N. David) 


2.15(2)3) 


Theorem 2. Suppose that the assumptions of Theorem 1, 
and also the following two conditions are satisfied 


(i) F(x) has finite moments of all order, 
(i) Ge) = explrz In| ~ |] has no zeros in its region 
of regularity (z complex). 
Then F(x) is a normal distribution. 


The author has continued this study in a paper “‘ On a 
class of distribution functions where the quotient follows 
the Cauchy law” [Trans. Amer. Math. Soc. (1959) 93, 
105-215]. 


(E. Lukacs) 


LAUBSCHER, N. F, (National Physical Research Lab., Pretoria) 
A note on Fisher’s transformation of the correlation coeffi 


J. R. Statist. Soc. B (1959) 21, 409-410 (2 refs.) 


Fisher’s tanh transformation for normalising the distribu- 
tion of the correlation coefficient, 7, is well known. Other 
transformations have been suggested, notably perhaps the 
sin transformation by Harley. The author considers 
whether, instead of taking log{(1 +r)/(1—r)}, it will be 
possible to obtain a transformed variance independent of 
p by taking log{(c,+r)/(c,—r)} and choosing c, and ¢, 
appropriately. He investigates this by the usual moment 
expansion of a Taylor series and concludes that c, = Coal 
in Fisher’s transformation is the optimum for that trans- 
formation. 


(Florence N. David) 
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MULLER, M. E. (Princeton University) 


A comparison of methods for generating normal deviates on digital computers—In English 


2.2 (6.2) 


cient—In English 


2.3 (1.5) 


F. Ass. Comp. Mach. (1959) 6, 376-383 (19 refs., 2 tables) 


A number of procedures have been suggested for transform- 
ing one or more uniform variates into a normal deviate on 
a digital computer. ‘This paper describes the following six 
procedures. 
(i) Direct Approach. Starting with two independent 
uniform variates, U, and U;, on the interval (0, 1) let 
X, = (—2 In U,)/*Cos 22 U;; 
X, = (—2 In U,)/?Sin 27 Ui. 
Then X, and X, are normally and independently 
distributed (0, 1). 
(ii) Inverse Approach. Solve X = X(U) in the equation 
« 
Ol 12x | e-#l2 dt, OSXTUA1. 
—0 

Over most of the interval (0, 1) for U, X = X(U) 

is approximated by Tchebysheff-type polynomials. 

As X increases, the degree of the polynomial is 

increased. For 127/128<US<1, a truncated con- 

tinued fraction expansion is used. This method 

was previously described by the author [7. Ass. 

Comp. Mach. (1958) 12, 167-174]. 

n 


(iii) Central Limit Approach. @= < U;, approaches 
i=1 


i) 
normality. This is studied for n = 12. 
(iv) Rejection Approach. Consider U, and U, as above 
and —b=X<b. Compute Y = —2b2(U,—0°5)?. 
If In U,SY, X = 6(2U,—1) is used as a normal 
deviate; otherwise reject the pair. 
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(v) Hastings’ Approach. 
2 3 
XA ae Tr | 2 agi] = ba 
CW) 41=0 
where 47 = —2Inqg and q=1—U([X@)], as in 
(2); 0<qS0-5. Values of a; and 6; are given. 


Teichroew’s Approach. 6 is used, except the range 
is restricted; for nm = 12, 2<@<10, where 


Prob[2 =@=10])221—2(10)>. 
For n = 12, let r = (6—6)/4. Then 


(vi) 


4 
DG PHT ny ctl ee, 
: i=0 
The values of a are given for this case. ‘They are 
based on a fitted Tchebysheff polynomial: the 
details are given -in a doctoral thesis at North 
Carolina State College. 


A brief critique is given of each approach. Comparisons 
were made on an IBM 704 on the basis of computing time 
per deviate, precision and memory space required. ‘The 
inverse approach is much faster and the rejection approach 
much slower than the other four approaches. The Central 
Limit and Teichroew approaches require the least amount 
of memory space and the direct and inverse approaches 
by far the most. The direct and rejection methods are the 
most precise and the Hastings and inverse approach are 
least precise; Teichroew’s method is not good for deviates 
on the extreme tail. 


(R. L. Anderson) 


WILKS, S. S. (Princeton University) 
Recurrence of extreme observations—In English 
JF. Aust. Math. Soc. (1959) 1, 106-112. 


This paper considers independent observations being drawn 
from a population with continuous distribution function 
F(x). Let y be the rth largest in the first m observations, 
and m the number of further observations required for k 
of these further observations to exceed y. By elementary 
methods involving the known distribution of F(y), the 
negative binomial distribution of n given Fy), and integration 
over F(y), it is shown that the probability function of n is 
given by 


_ (n—1\ [m—1)\ (m--n—1\ & 
‘aa aes 1) ee Ween Ore 
(8 SS 18s IESE is Ben 


independently of the functional form of F(x). The mean 
of this distribution is mk(r—1); in the case of k = 1, 
its variance is mr(m—r+1)(r—1)-2(7—1)+. The limiting 
distribution for large values of m of the ratio z = n/m is 
determined; if k = 1, this distribution has density function 
rl =2) 4) 2>0: 

The same ~(n) with r = 7,+72(<m) gives the probability 
that 2 further observations will be required for k of these 
further observations to lie outside the interval between the 
7,th from smallest and the 7th from largest of the first m 
observations. 


381 


382 


2.6 (5.6) 


Some numerical results are given in connection with 
predicting exceedances of the extremes obtained in a pre- 
liminary sample; for example, if m is large, the probability 
is approximately 0-95 that fewer than 3-47m further observa- 
tions will be required for an observation to lie outside the 
interval between the smallest and largest of the first m 
observations; in the same situation, the probability is 
approximately 0-95 that more than m/38 observations will 
be required. 


(E. W. Bowen) 


EEDEN, CONSTANCE VAN. (Mathematical Centre, Amsterdam) 
Some approximations to the percentage points of the non-central t-distribution—Jn English 


Statist. Dept., Math. Centre (1960), Report $242. 


In this paper the following approximations to the percentage 
points of the non-central t-distribution are considered: 


(i) Three closely related approximations based on a 
normal approximation to the distribution of z-+k y/x?, 
where 2 possesses a N(0, 1)-distribution and 
xa x°-distribution with f degrees of freedom. 
Two of these approximations were derived by 
Jennet & Welch [Suppl: $. R. Statist. Soc. (1940) 
7, 80-88] and by Johnson & Welch [Biometrika 
(1940) 31, 362-389] respectively. The third one 
was derived by Hendricks [Ann. Math. Statist. 
(1936) 7, 210-221] for the percentage points of the 
central ¢-distribution and could be generalised for 
the non-central t-distribution. 

(ii) Four closely related approximations based on a 
method used by Goldberg & Levine [Ann. Math. 
Statist. (1946), 17, 216-225] for the central t- 
distribution. In this case the percentage points 
are expressed in a power series in 1/f, the coefficients 
being functions of a N(0, 1)-variable. 

(iii) ‘The approximation of Merrington & E. S. Pearson 
[Biometrika (1958) 45, 484-491] based on a Pearson 
type IV—approximation to the non-central t- 

. distribution. 

(iv) The approximation of Harley [Biometrika (1957) 
44, 219-224] based on an approximation of the 
distribution of a function of t by the distribution 
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HARTER, H. L. (Wright Air Development Center, Dayton, Ohio) 


3.2°(11-1) 


of the correlation coefficient in samples from a 
bivariate normal distribution. 

(v) An approximation described in Owen’s table of 
factors for one-sided tolerance limits for a normal 
distribution (Sandia Corporation Monograph, 1958). 
This approximation is based on an (approximate) 
relation between one- and two-sided tolerance 
limits. 

(vi) An approximation based on a normal approximation 
to the non-central t-distribution. 

These approximations are compared numerically with 
the exact percentage points for several values of the non- 
centrality parameters 5, the probability « and the number 
of degrees of freedom f. Several tables are given to make 
this comparison possible. For the central ¢-distribution 
the approximations are especially important for large values | 
of f, where tables are not available. Then one of the 
approximations (ii) is the best one: it is identical with the 
exact percentage points in three decimal places for 
0-50<«<0-99. For the non-central t-distribution the 
values f = 2(1)9 were used. For these values of f the 
approximations (i) and (ii) are the most important ones. 
In some cases (iii) or (iv) may be better, but for these 
approximations more computational work is needed. The 
approximations (v) and (vi) are not recommended; (vi) 
gives only reasonable results for the central t-distribution. 

Further, the report contains a description of the exact 
tables of the non-central t-distribution. 


(Constance van Eeden) 


3.8 (4.3) 


The use of sample quasi-ranges in estimating population standard deviation—In English 


Ann. Math. Statist. (1959) 30, 980-999 (21 refs., 6 tables, 1 fig.) 


The use of sample quasi-ranges in estimating population 
* standard deviation is discussed, and a number of tables are 
provided for cases of interest. Section 1 deals with the 
normal population, and Section 2 with rectangular and 
exponential populations. 

The rth quasi-range, w,, of a sample of size n is defined 
as the range of the (n—2r) sample values remaining after 
removal of the r smallest and 7 largest; thus w, stands for 
the range of the whole sample. 


Section 1 which deals with the normal population 
N(u, 1), begins with a tabulation of E(w,) in Table 1. This 
table gives E(w,), accurate to within a unit in the sixth 
decimal place, for r = 0(1)8 and n = (2r+2)(1)100. Table 2 
presents var (w,), accurate to within a unit in the fifth 
decimal place, for r = 0(1)8 and m = (2r+2)(1)100. 

The second table is concerned with efficiency of estimates, 
expressed in terms of 6 = s/c, where s is the sample standard 
deviation and c, is the correction factor to provide un- 
biasedness. The efficiency of an estimate @ is given by 
eff G = var 6/varG. Table 3 shows the efficiency of the 
unbiased estimate w,/E(w,) for r = 0(1)8 and n = (2r +2)(2) 
- 50(5)100, accurate to within 0:01 per cent. Table 4 compares 
_ for each n = 4(1)100, the efficiencies of the most efficient 
estimate based on one sample quasi-range, the most efficient 
estimate based on linear combinations of two adjacent 
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quasi-ranges, and the most efficient estimates based on two 
quasi-ranges w, and w, with r<r’S8; each estimate is 
given, along with its efficiency. Comparisons with Grubbs- 
Weaver estimates are made in Section 1.6. 


Section 2 deals with non-normal populations. For the 
standard (zero mean and unit variance) rectangular popula- 
tion, the unbiased estimate of o based on the range is given 
in Table 5 for 2 = 2(1)100; the numerical coefficients on 
W are accurate to within a unit in the sixth decimal place. 

For the standard (unit mean and variance) exponential 
population the most efficient estimates (in comparison with 
the efficient estimate %) based on quasi-ranges were found 
to have efficiencies varying from 50-00 per cent. to 61:73 per 
cent., and so were not tabulated. It is stated that investiga- 
tion of efficiencies of estimates based on single order statistics 
has begun. 

Finally, Section 2.3 considers the bias involved if 
normality is assumed when the population actually is 
exponential or rectangular. ‘Table 6 shows these biases, 
for n = 2(1)100, for the various estimates tabulated in 
Table 4. 


(R. Hooke) 


KUIPER, N. H. (Landbouwhogeschool, Wageningen) 3.0 (2.0) 
‘On the random cumulative frequency function—In English 
Proc. Kon. Ned. Akad,, Wetensch. A (1960) 63 
Indag. Math. (1960) 22, 32-37 (6 refs., 2 figs.) 


Two random variables are called “‘ isomorous ”’ if they have (b) The interval [0, 1] can be mapped into the reals 
the same cumulative frequency function. If the independent modulo one, on which operates the natural group of 
mutually isomorous random variables x,, ..., X, are isomorous translations. In terms of the reals modulo 1, 
to a given continuous random variable y, with cumulative theorems 1 and 2 are then deduced from the in- 
distribution function F(x), and if Fy(«) is the cumulation variance properties under this group. 
frequency function of the random sequence W = (xj, ..., Xn), 
then (N. Kuiper) 

Theorem 1. (Gnedenko-Mihalavicz) u = dF (x) 

Py (2) > F(a) 


is a uniform random variable on [0, 1], and 


Theorem 2. (Birnbaum, Pyke). The optimal value ont 
of the random function Fy(x)—F(x) (that is the 
[random] value x for which the minimum is obtained) 
is such that F(xop:) = U is a uniform random variable 
on [0, 1]. 

In this paper the author gives a simple proof of these 

two theorems, which is based on two ideas: 


(a) The results are independent of the continuous 
random variable y (hence independent of the function 
F(t). Then he restricts to the case that y is uniform 
on [0, 1]. 
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PACHARES, J. (Hughes Aircraft Co., Culver City, California) 3:3) Clie) 
Table of the upper 10 per cent. points of the studentised range—In English | 
Biometrika (1959 46, 461-466 (4 refs., 3 tables) 4 


Let w be the sample size of 7 values sampled from a normal 
population, and s be a sample estimate of the standard 
deviation based on v degrees of freedom and distributed 
independently of w. The author gives to three significant 
figures, a new table of the upper 10 per cent. points of the 
studentised range, g = w/s for nm = 2(1)20 and v = 1(1)20, 
24, 30, 40, 60, 120, oo. Details are given of the formulae 
used and of the method of computation which was carried 
put on a IBM 704 computer. 
For checking purposes the author recomputed the upper 
5 per cent. points already given in Table 29 of Pearson & 
Hartley’s Biometrika Tables for Statisticians 1, (1954) and 
discovered several minor errors in their table. In addition 
the editors of Biometrika were able to correct some similar 
minor errors in their table of the upper 1 per cent. points 
from some highly accurate, but as yet unpublished, calcula- 
tions by Harter. The corrected tables of both the upper 
5 per cent. and 1 per cent. points, given to two decimal 
places, and for the same values of n and vy, are therefore 
included with the author’s new table of the upper 10 per 
cent. points. 


(D. Harris) 


e ae ae vty Sa ee Ee et eh 


PEARSON, E. S. (University College, London) | 3.2 (2.1) 


Note on an approximation to the distribution of non-central x’—In English 
Biometrika (1959) 46, 364. 


In connection with Johnson’s paper [Biometrika (1959) 46, 
352-363], this note briefly sets out some details of another 
approximation to the distribution of non-central x2. This 
particular approximation used a x? distribution with the 
correct mean, standard deviation and skewness factor (8) 
but not necessarily exactly the correct starting place. These 
conditions appear suitable when the curve does not rise 
sharply at the lower end. 
The errors between the exact values and upper and 
lower 5 per cent. points of the approximation are given in 
column (v) of Table 1 in Johnson’s paper. The conclusion 
is that “‘ Taken as a whole, it is clear that for the particular 
values of the parameters selected, the three-moment ? 
approximation is better than the other approximations ’’. | 
It is emphasised by the author that the search for accurate | 
approximations to non-central statistics was not the main 
purpose of Johnson’s paper. 


(Barbara Saw) 


Editorial Note: see also abstract No. 375, 2.5. 
Johnson 
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PILLAI, K. C. S. (Statistical Center, University of the Philippines) 3.3 (11.1) 
Upper percentage points of the extreme studentised deviate from the sample mean— 
In English 
Biometrika (1959) 46, 473-474 (4 refs., 1 table) 


Let x, Sx, S... <x, be an ordered sample of size n from a 
normal population with variance o”, the mean of the sample 
%, and s, the square root of an independent mean square 
estimate of o? based on v degrees of freedom. Then the 
extreme studentised deviate from the sample mean is 
- defined as ty, = (%,—)/s, or (¥—x;)/s,.. The upper 5 per 
cent. and 1 per cent. points of t, are given in a new table 
for n = 2(1) 10, 12 and vy = 1(1) 10. The method of prepara- 
tion of this table is discussed. 


(D. Harris) 


Editorial Note: see also abstract No. 389, 3.3. 
Pillai & Tienzo 
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PILLAI, K. C. S. & TIENZO, B. P. (Statistical Center, University of the Philippines) 


3.3 (3.8) 


On the distribution of the extreme studentised deviate from the sample mean—ZIn English 


Biometrika (1959) 46, 467-472 (11 refs.) 


Let x; Sx.S... Sx, be an ordered sample of size n from 
a normal population with variance o?. The sample mean 
is * and s, is the square root of an independent mean square 
estimate of o? based on v degrees of freedom. 

Let (%—x,)/o 
CP 
The distribution of u has previously been derived by several 
authors [e.g. David, H. A., Biometrika (1956) 43, 449] and 
probability integrals given for nm ranging from 2 to 25. 
Percentage points of t,, the studentised extreme deviate 
from the sample mean, have also been obtained for sample 
sizes n = 3(1) 10, 12 and some values of v =10. 

The present paper develops the distributions of u and 
of t in series form, for n = 3, 4, 5 and vX<10. Percentage 
points of t, derived from formulae given in this paper are 
incorporated in the more extensive tables of the 5 per cent. 
and 1 per cent. points given by Pillai [Biometrika (1959) 46, 
473-474]. 

The authors point out that percentage points for v<10 
are particularly useful in many situations, for example in 
experiments with small numbers of treatments. This is 
illustrated by an example of a randomised block experiment 


u = (%,—x)/o or 
tn = (%,—x)/s, or 
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with five treatments and three blocks, in which the test is 
applied to the difference between the largest treatment 
mean and the general mean. 


(D. Harris.) 


Editorial Note: see also abstract No. 388, 3.3. 


3.1 (2.4) 


An elementary proof of the theorem on which the x? method is based—IJn German 


Metrika (1959) 2, 89-93 (3 refs.) 


For the theorem that, given a multinomial distribution with 
k classes, the chi-square statistic follows asymptotically the 
chi-square distribution with k—1 degrees of freedom, the 
proof of K. Pearson [Phil. Mag (1901) 50, 157-172] and the 
proof using the characteristic function [Cramér, Mathe- 
matical Methods of Statistics (1945)] are well known. 

Morgenstern [Metrika (1958) 1, 239-241] gives a more 
elementary proof based on complete induction from k to 
k+1. For k = 2 he uses the asymptotic normality of the 
binomial distribution. For k+1 he splits up the statistics 
into three parts. The first follows the chi-square distribu- 
tion with k—1 degrees of freedom, the second is asymptotic- 
ally normal and the third vanishes asymptotically. 

This paper gives another approach using the asymptotic 
normality of the Poisson distribution. ‘The conditioned 
joint distribution of k random variables is the multinomial 
distribution if each of k random variables follows a Poisson 
distribution with the parameter mp; (2p; = 1) subject to 
the restriction that the sum of the & variables is . The 
distribution of each random variable, and also the sum of 
the & variables lacking the restriction about the sum, con- 
verge with increasing m to a normal distribution. The 
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conditioned asymptotic distribution, again subject to the 
restriction that the sum is n, is a R—1 dimensional normal 
distribution: this leads to the theorem. 

It should be mentioned that the same idea is stated by 
Sir Ronald Fisher [¥. R. Statist. Soc. (1922) 85, 87-94] 
and is also considered in the expository paper by Cochran 


[Ann. Math. Statist. (1953) 24, 315-345]. 7 


(E. Walter) 


‘WATSON, G. S. (Australian National University, Canberra) 
Some recent results in chi-square goodness-of-fit tests—In English 


Biometrics (1959) 15, 440-468 (25 refs., 1 table) 


The author begins by considering the effect of estimation 
of parameters on the distribution of Pearson’s X® criterion 
of goodness of fit. Frequently the estimates are obtained 
by maximum likelihood or some asymptotically equivalent 
method applied to the data before the grouping necessary 
for the chi-square test is made. The author points out 
that this leads to a distribution which, even asymptotically, 
is not x” but differs from x? with the usual degrees of freedom 
by a positive definite quadratic form involving the unknown 
parameters. Hence, even though the correct estimators 
(i.e., best asymptotically normal estimators based on cell 
frequencies) may be troublesome to obtain, the author 
recommends their use and suggests that they may be found 
by an iterative solution of the appropriate (e.g., maximum 
likelihood) equations (with the simpler estimators as starting 
values) with less difficulty than that required for an assess- 
ment of the effects of using alternative estimators. 

The disturbing effect of incorrect estimation can be 
partitioned out, an operation described as being equivalent 
to calculating X? with the estimators Which result from one 
cycle of iteration to the maximum-likelihood equations. 
This appears to be satisfactory since, in large samples, one 
iteration should be sufficient. 

These results, which are stated in the introductory 
section, are demonstrated in Section 2 of the paper, as well 
as the relation of the effect of estimation on chi-square and 
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Standardised t—In English 
Nature (1960) 185, 558. 


Students t may be standardised by expressing it in units 
of its own standard deviation; thus, t, = tV/{(v—2)/v}, 
where v is the number of degrees of freedom. The procedure 
reduces the variability of the tails with v for a given probability 
P, and in particular for P = 0:05 and 0 2v=4, ¢, lies 
- between 1:96 and 2-0 whereas t lies between 1:96 and 2-776. 
The finding extends the applicability of certain “large” 
_ sample methods down to 4 degrees of freedom. 

The standardising is extremely simple to apply—merely 
divide the error sum of squares by v—2 instead of by v 
and use this value to calculate the appropriate standard 
error (SE). For example, if d is the difference between 
two means and d=2 SE, PS0-05, that is the difference is 
_ significant at the 5 per cent. level. If d<1-96 SE, P>0-05 
and the difference is not significant. Between these two 
limits of d, P ~ 0-05; actually 0:0535>P>0-0455. Again, 
+2 SE will give a close approximation to 95 per cent. 
confidence limits (actually between 95 and 95-45 per cent.). 
When the formula is applied to the correlation coefficient, 
r, we find that 7 is significant at the 5 per cent. point if 


r>2/V(v+2). 


(J. B. de V. Weir) 
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its partitions to Neyman’s smooth goodness-of-fit tests. 
It is stated that the theory of Section 2 is adequate to deal 
with 

(a) true multinomial distributions, 

(6) grouped discrete distributions, and 

(c) continuous distributions grouped by using intervals 

which are fixed for all samples. 

In the third section, finite samples from the multinomial 
are discussed. It is shown that, as the sample size increases, 
X* tends to the chi-square distribution obtained uncon- 
ditionally in the limit. A table is given to make a comparison 
of three X* statistics for small sample situations. Continuous 
distributions are treated in Section 4. An argument is 
presented to justify the intuitive procedure in large sample 
situations of ignoring the manner in which the class intervals 
were found, as long as the method of their construction is 
“fair”. ‘The final section of the paper treats two general 
methods for partitioning X?. 

The author’s conclusion is that ‘‘. . . The theory and 
application of X? to true multinomial situations leaves little 
to be desired. For continuous distributions, the approach 
is less natural although . . . it may be made to solve the 
main problems satisfactorily.” 


(R. H. Brown) 


3.1 (—.-) 


ZIA-UD-DIN, M. (Punjab University, Lahore) 
The expression of k-statistic k,, in terms of power sums and sample 
moments—In English 
Ann. Math. Statist. (1959) 30, 825-828 (7 refs.) 


With the help of tables by Abdel-Aty [Biometrika (1954) 41, 
253-263] the author gives in this note the expression for 
the k-statistic kj, in terms of the power sums 
n 
Sa aie 
4=1 
In a previous note [Ann. Math. Statist. (1954) 25, 800-803] 
the author found the corresponding expression for ky and 


Ryo. 


(R. Hooke) 
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AZORIN, F. (Central University of Venezuela, Caracas) 


4.3 (6.1) 


Statistical considerations on the Cobb-Douglas function—In Spanish 


Estadistica Espafiola (1959) 4, 19-29 (9 refs.) 


This paper is concerned with the so called Cobb-Douglas 
production function. Not with its general interpretation 
or justification but with the estimation of parameters. After 
a brief introduction the author considers the fitting of an 
hyper-surface to the exponential function in which k 
independent variables represent inputs, whereas the 
dependent variable y is to be taken as output. 

Supposing a simple random sample of size n in each of 
k+1 groups, in which each group represents a variable or 
productive element plus a product quantity, the observations 
being made in particular firms or industries. The para- 
meters of the fitted exponential function have to be estimated. 

For the sake of simplicity, the case is first considered 
as being of one dimension. The two parameters are 
estimated by the least squares method, the result being 
maximum likelihood estimates in the general linear hypothesis. 
For practical reasons the fitting of an hyper-plane is made 
directly to the logarithms of the observations. 

Now, special attention is given to characteristics which 
can be defined as the marginal productivity, obtained from 
the previously estimated parameters. There would no be 
difficulty if the variable x and the estimated y might be 
considered as constants. But ¥ is really an estimator which 
takes different values in the possible sarmples of size n. 
A method due to Carter and Hartley consists in an approxima- 
tion for the calculation of the variance of the estimated 
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marginal productivity. Extending the results to the k- 
dimensional case one has a matrix expression of the above 
variance. 

However, the estimator of the variance of the marginal 
productivity is not unbiased. In order to obtain an estima- 
tion of that bias, the author observes that it can be con- 
sidered as a product of a constant into b/(x). 

Now the mathematical expectation of the above expression 
can be written as a sum of two terms, the second of which 
may be expanded in a series of linear functions of moments. 
Finally, dividing by x, an expression of the bias of the 
marginal productivity is obtained in terms of x, y, the 
logarithms of the observations, the population regression 
coefficients and the variance of the logarithm of the pro- 
duction. A first approximation is obtained when the 
mathematical expectation of the estimation of y is approxi- 
mately equal to y. 

When the distribution of the logarithm of the estimated 
production is normal, the moments of odd order vanish 
and a simpler expression of the relative bias is given. 

It is further shown that the bias is not negligible relative 
to the standard error of the marginal productivity. Finally, 
it is pointed out how to increase the accuracy by decreasing 
the bias and not only through a large precision of the 
estimate. 


(S. Rios) 


4,4 (2.5) 


Confidence limits in the case of the geometric distribution—In English 
Biometrika (1959) 46, 260-264 (4 refs., 2 tables, 2 charts) 


The author is concerned with a problem in life-testing. 
X is the number of trials and m is its expected number. 
X has a distribution called here the geometric distribution 
although it is sometimes also called the sequential Poisson 
or the sequential binomial. Suppose WN articles each life- 
tested with the 7th item requiring X;, trials before failure. 
A sufficient estimator for m is the mean of the x’s. Also the 
sum of the X’s is distributed as a negative binomial. Thus 
for a given m and N, limits can be found for X, and con- 
sequently for m for given N and X. 

The charts were computed on a IBM computer and the 
author remarks that the programme will be kept available. 
The charts are in agreement with the work of Anscombe 
if the extra corrective terms are used where necessary. 
Examples are given and interpolation and extrapolation into 
the charts are discussed. 


(Florence N. David) 
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(Nat. Bur. Stand., Colorado and U.S. Naval Ord. Test Sta., 


4.4 (11.1) 


Confidence intervals for the expectation of a Poisson variable—JIn English 
Biometrika (1959) 46, 441-453, (14 refs., 4 tables, 2 figs.) 


A table of optimum two-sided confidence intervals is given 
for confidence coefficients 0:80, 0-90, 0:95, 0:99 and 0-999 
and all values of the variable from 0 to 300. For a given 
confidence coefficient, the confidence belt thus tabulated is 
determined by the properties: 


(i) that of all confidence belts for the Poisson expectation 
which are based directly on the Poisson variable c, 
it is as narrow as possible in the c-direction, 

(ii) that of all such equally narrow confidence belts it 
has the smallest possible upper confidence limits. 


These give a system of the same type as that tabulated 
by Crow [Biometrika (1956) 43, 437-432] for the parameter 
of the binomial distribution. Reference should be made 
to this earlier paper for full details of definitions and calcula- 
tions. A property of Poisson probability sums which is 
discussed in the present paper resulted in simplification of 
the calculations, most of which were carried out on a high- 
speed electronic computer. 

Denote by 5, the system described above, by 5, the system 
of central confidence intervals given by Pearson and Hartley 
[Biometrika Tables for Statisticians 1, (1954)], by 53 the 
system having the property (i) above, defined by a condition 
due to Sterne [Biometrika (1954) 41, 275-278] and by 6; 
the shortest unbiased central system (obtained by adding 
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Combining unbiased estimators—In English 
Biometrics (1959) 15, 543-550. (3 refs.) 


The problem considered is that of combining linearly two 
independent unbiased estimators &,, @, of a parameter & 
when the ratio of the variances or, oF of &, & is unknown. 
The combined estimator @ is termed by the authors a 
‘uniformly better unbiased estimator” of @ (than &, &2) 
if its variance is less than or equal to the smaller of as 
o% for all values of o7, 03. It is well known that no linear 
combination with constant weights can be uniformly better 
but under certain circumstances it is possible to find random 
weights which achieve this. 

An important special case is studied and the following 
main result obtained. Let x, y be normal variates with 
common mean p and unknown variances o;/m, oz/n2; let 
m,S%/0%, m2s3/03 be distributed as x? with m,, m, degrees of 
freedom and suppose all random variables are independent. 
Then a necessary and sufficient condition that 

f= (mys5x +nzsty)| (155 +1387) 
is an unbiased estimator of f which is uniformly better 
than either x or y is that m, and mz are both larger than 9. 

An application of the theorem is to the recovery of 
interblock information in a balanced incomplete block 
design. Suppose it is desired to estimate the difference of 
two treatment effects. The best linear combination fl of 
intra-block and inter-block estimates is known (e.g. 
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a random variable, uniformly distributed over (0, 1), to 
the observed discrete variable). 

These systems are compared, and the percentage de- 
creases in length of interval achieved by using 84 instead 
of 8, is shown to be appreciable for at least small values 
of c. Expected lengths of 90 per cent. confidence intervals 
for 0SmS14 were calculated and, although those of 8; 
were the smallest in the range, those of 8, exceeded them 
by 0:06 at most for m=6 and were shorter than those of 
8, by 0:45-0-9 over the full range. Comparing 5,, 8; and 
64 with various large sample approximations shows that 
it is a useful and valid procedure to employ 8, intervals 
as far as they are tabulated together with a given large 
sample approximation for larger c due to Hald [Statistical 
Theory with Engineering Applications (1952) New York: 
Wiley]. 

Thus the 6, system is believed to be preferable to 8, 
in many applications, though the author does not propose 
its use for one-sided confidence intervals since division of 
probabilities between the tails of the distribution is not 
restricted. 


(D. Harris) 


4.1 (7.3) 


Kempthorne, Design and Analysis of Experiments, p. 535). 
From the theorem it follows that fi is uniformly better than 
either individual estimate if (i) rt —b -t+1 218 and b—t = 9 
or if (Gi) b—t210 where 7 = no. of replications of each 
treatment, b= no. of blocks and t = no. of treatments. 
So if (i) or (ii) is true, inter-block information should always 
be used. It should be noted that Yate’s method of com- 
bining inter-block and intra-block estimates is different. 
This method is discussed by Graybill & Weeks [Ann. 
Math. Statist. (1959) 30, 799-805]. 

The inaccuracy due to estimating the weights in the 
balanced incomplete block is also studied by comparing 
the variance of fi with that of the estimator appropriate 
when the ratio 07/02 is known. 


(H. A. David) 
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GUTTMAN, I. (McGill University, Montreal) 


Optimum tolerance regions and power when sampling from some non-normal universes—JIn English 


4.5 (2.5) 


Ann. Math. Statist. (1959) 30, 926-938 (3 refs., 8 tables) 


This paper presents optimum f-expectation tolerance 
regions for the single and double exponential distributions 
and exhibits their corresponding power functions. A 
tolerance region is a f-expectation tolerance region if the 
average probability content of the region is at most 8. 
It is a similar B-expectation tolerance region if the average 
probability content of the region is 8. (These concepts are 
defined and optimum f-expectation tolerance regions and 
their power functions are developed for k-variate normal 
distributions in two of the given references.) 

For the single exponential distribution with location 
parameter « and scale parameter a, similar f-expectation 
tolerance regions which are minimax and most stringent, 
and which have the ability to pick up 100 8 per cent. sets 
on the right-hand tail of the distribution, are obtained for 
three cases: 


@) » known (= p,), ¢ unknown, 
Gi) » unknown, oc known (= a,), and 
Gi) » and o unknown. 


The £-expectation tolerance region for the double exponential 
distribution with » known (= po) and which is constructed 
so that it has the ability to pick up the centre 100 f per cent. 
part of the distribution is also found. These four regions 
are [po tagx, ©), [xG)—bgoo, ©), [xa)—Cgs, 90) and 
[Ho—dgt, potdgt], respectively, where ¥ =n Z?_1x, 
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may = thing_;%, s=(—1)4*27_,@:—x%q)) and t= 
21 |%;—po|and where x is the sample size. The quantities 
%g, bg, cg and dy are tabulated for n = 1(1)20, 30, 40 and 
60 and for 8B = 0-99, 0-95, 0:90, and 0:75. The part of the 
distribution picked up by these tolerance regions is dependent 
on the chosen “‘ measure of desirability ”’. 

For the first three tolerance regions presented, the 
measure of desirability employed is a single exponential 
distribution with larger scale parameter, and for the last 
region, a double exponential distribution with smaller scale 
parameter is used. Expressions are presented for the power 
(degree of confidence) that the f-expectation tolerance 
regions cover the specified 100 B per cent. parts of the 
distributions (when the desirability of covering these 
parts is specified). Actual values of the power are tabulated 
for each of the four cases for a measure of desirability of 
0-995 and for m = 1, 3,5, 7, 10, 15, 30 and 60 and B = 0-99, 
0-95, 0-90 and 0-75. 

The distributions that are necessary for the evaluation 
of %g, bg, cg, and dg are derived in an appendix. 


(J. W. Wilkinson) 


4.1 (6.1) 


The extension of Wald’s method of fitting straight lines to multiple regression—In English 
Rev. Int. Statist. Inst. (1958) 26, 37-47 (7 refs., 7 figs.) 


In 1940 the late A. Wald [Ann. Math. Statist. (1940) 11, 
284-300] proposed a simple method for the estimation of 
a regression coefficient in the case of one explanatory 
variable. A first extension to multiple regression analysis 
was given by Gibson & Jowett [Appl. Statist. (1957) 6]. 

The present authors propose an alternative extension 
to multiple regression analysis which can be described as 
follows. Just as in Wald’s method, a straight line in a 
two-dimensional plane is determined by two centres of 
gravity of subsets of the material, so in the present method 
a plane in a three-dimensional space is determined by 
three centres of gravity. These are computed from three 
groups of observations such that no observation is contained 
in more than one group. The problem of how these groups 
should be chosen is solved in an optimal way, viz., such 
that the sampling variance of the estimator is a minimum. 
This is done under two alternative assumptions concerning 
the simultaneous parent distribution of the two explanatory 
variables. In both cases the pairs of observations on these 
variables are assumed to be uniformly distributed, viz., in 
the first case over an equilateral triangle, in the second case 
over a square. | 
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The efficiency of this estimator is computed under both 
assumptions and compared with that of the Gibson-Jowett 
method, which appears to be superior in this respect. As 
to the computational effort, however, the present method is 
simpler. 


(T. Kloek) 
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JILEK, M. & LIKAR, O. (Central Res. Inst. of Food Ind. and Res. Inst. for Food Ind. Econ.) 


4.5 (3.2) 


Coefficients for the determination of one-sided tolerance limits of normal distribution — 


In English 


Ann. Inst. Statist. Math., Tokyo (1959) 11, 45-48 (2 refs., 1 table) 


A table of coefficients k is given so that the interval (—«, 
*%-+ks) or (*—ks, +00) covers with the probability p at 
least the ratio « of the normal population with unknown 
parameters (mean, » and standard deviation, c); where « 
is the sample mean and s is the unbiased estimate of the 
standard deviation. The authors use the fact that k has 
the form k = (1//n)t(n--1, U,/n, 1—p), where t(f, 8, «) is 
the 100« percentage point of the non-central t-distribution 
with f degrees of freedom and with the patameter of non- 
centrality 5. U, is the 100(1—«) percentage point of the 
normal distribution N(0, 1). The values of coefficients k 
are tabulated for m = 5(1) 20(5) 50(10) 100(100) 300, 500 
and 1000, for p = 0-90, 0-95, 0:99 and for « = 0-90, 0-95, 
0-99, 

The method of calculation is essentially an iteration 
method on the basis of the tables of non-central t-distribution 
which were proposed by Johnson and Welch (1939) and 
Janko (1958). The first approximate of value t is calculated 
in accordance with the approximate formula for large sample. 
An auxiliary value A, is determined from the non-central 
t-distribution tables. "The second approximate value ¢, is 
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KULLDORF, G. (Department of Statistics, University of Lund) 


calculated by substituting A, in place of U, of the asymptotic 
formula. The iteration is stopped when two successive 
approximations give the same value. ‘The authors state 
that in about one-third of the cases agreement was reached 
in the second approximation. 


(KK. Isii) 


4.3 (2.8) 


A problem of maximum likelihood estimation from a grouped sample—In English 


Metrika (1959) 2, 94-99 (4 refs.) 


Two random variables X and Y have the distribution 
functions F(x) and G(x), and the relation F(x) = 5G(x«) 
holds for all x<x’. The paper is concerned with the 
estimation of 6 when G(x) is known either completely or 
with the exception of p parameters. Suppose that 7 in- 
dependent observations about X are made which are grouped 
into k specified intervals and let x )(= — ©), x, ..., X:(= %) 
denote the ordered group limits. It is assumed that 
X,_1 = x’, i.e. only in the last interval the relation does 
not hold. 

When G(x) is completely specified, it is a straight- 
forward proof that the maximum-likelihood-estimate 1s 
§ = (1 —n;/n)/G(x;,_1), where n, denotes the number of 
observations in the last interval. Z 

The variance of maximum likelihood estimate 0 is also 
given and it is stated that m, is a sufficient statistic for 6 
and complete, and that 0 is the uniformly minimum variance 
unbiased estimate of 96. 

In the other case, where G(x) is known—except the p 
parameters, it is shown that the maximum likelihood 
estimator of the parameter in this case is the same as the 
maximum likelihood estimator of the same parameters 
when a truncated sample of n—n, observations about the 
random variable Y is made and the truncation point is 
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x = %,_,;. Here it is possible to use the known results 
about truncation for the estimation of the parameter. Then 
the maximum likelihood estimate of 5 is the same as in the 
first case using in G(x) the maximum likelihood estimate 
estimates for unknown parameters. 

This estimation problem has occurred in a study of 
human migration [Kulldorf, Migration Probabilities (1955) 
Lund] where G(x) was represented by the exponential, the 
Pareto and the logarithmic-normal distributions. 


(E. Walter) 
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LEVERT, C. (Royal Meteor. Inst., de Bilt, Netherlands) 


4.5 (11.3) 


A nomogram for confidence intervals and exceedance probabilities—In Dutch 
Statist. Neerlandica (1959) 13, 3-14 (5 refs., 1 table, 3 figs.) 


This paper deals with two problems regarding the probability 
8, that an observation of a normally (pu, o)-distributed 
random variable exceeds a given value W. If the true 
mean value pw and the true variance co? are not known 
(whereas the sample furnishes estimate # and s?), these 
problems are as follows: 


(i) If W is given, to determine a confidence interval for 


> 
(ii) If B is given, to determine a confidence interval for 
W, both intervals being based on a significance 
level «. 


The problems are solved by means of an approximation 
for the exceedence probability of a random variable which 
follows the so called non-central t-distribution. This 
approximation enables us to construct a nomogram. 

After having treated the problems theoretically, the 
author explains the construction of the nomogram. Next 
the application is illustrated by some numerical examples 
from the field of climatology; for example, with regard 
to the probability of a monthly quantity of precipitation 
surpassing the maximum ever measured and the reliability 
of such a probability. 
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MALLOWS, C. L. (University College, London) 
The information in an experiment—IJn English 
J. R. Statist. Soc. B (1959) 21, 67-72 (8 refs.) 


Consider an experimenter A and experiments designed to 
increase A’s knowledge about a parameter 0, an experiment 
E being taken to mean the observation of a random variable 
x whose distribution is assumed known (to A) except for 
the value of 6. It is attempted to construct a numerical 
measure of the information obtained from FH, with the 
object of comparing different experiments. 

It is assumed that (1) A’s beliefs at any stage can be 
completely described by the specification of a distribution 
of @ (his belief distribution b(6)); (2) A proceeds by Bayes’ 
theorem; (3) A’s knowledge is an (unknown) function of 
his belief distribution and the true value of 9; (4) a numerical 
measure W of A’s knowledge exists, this is called A’s 
information. It follows that the average expected increase I 
in A’s information can be regarded as the decrement in a 
risk function, and is additive for successive experiments. 

It is further assumed that (5) the change in information 
is invariant under transformations of 6, and (6) the average 
change is non-negative. It is thought that there is essentially 
only one function W satisfying these assumptions, namely 
W(0(.), 9) = log b(8) (a footnote is incorrect). The resulting 
quantity I can be identified with Good’s “ average weight 
of evidence ”’ for the true‘value of 6, and is closely related 
to the measures proposed by Fisher, Kullback and Leibler, 


and Lindley. 
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In the final section the accuracy of the nomogram is 
studied by means of a comparison of figures read from the 
nomogram with the results furnished by the approximate 
formulas or given by the tables of the non-central t-distribu- 
tion, published by Resnikoff and Liebermann [(1947) 
Stanford University Press]. The nomogram for three 
levels: « = 0-005, 0-025, and 0-05 are given. The curves 
are drawn for sample sizes of N = 6, 8, 10, 20, 30, 40, 50, 
100, 500, 1000, 10,000 and oo. The size of the nomogram 
is 17 cm. by 27 cm. 


(C. Levert) 


4.1 (4.7) 


A comparison of experiments is defined; it is shown 
that “ Fy is sufficient for E,’’ (in Blackwell’s sense) implies 
“EF, is uniformly more informative than E,”’ (in the present 
sense), and that this in turn implies “ EZ, is more informative 
than £,”’ (in Lindley’s sense); but neither implication can 
be reversed in general. 

A sufficient statistic is “‘ uniformly fully informative ’’; 
for each 6 and 6(@) the likelihood ratio statistic is ‘‘ locally 
fully informative’”’. Asymptotically unbiased, the normal 
estimators are ordered according to their variances. It is 
shown that the stopping rules for sequential binomial 
sampling which are obtained by keeping the maximum 
increase in information constant, are very similar to Lindley’s. 

The author distinguishes between the experimental 
situation and that arising in communication theory; it is 
claimed that here it is not appropriate to average over all 
possible values of 0, and that the quantity I is a satisfactory 
measure of “‘ value of information ”’. 


« 


(C. L. Mallows) 
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MIHOC, G. (Bucharest University) 4.3 (8.1) 
A method for determining the sample precision—IJn Rumanian 
Revista de Statistica (1959) 5, 11-23 


This paper is the sequel to an earlier one entitled ‘The 
sampling method ” [Revista de Statistica (1957), 12, 21-35] 
dealing with the precision of the results in a sample, by 
making use of the Boole formula. 

The paper is divided into two parts. In the first one 
a population A with two characteristics (a, b) is investigated 
using simple samples. The estimation of the precision of 
a/b is determined by means of the (estimated) precisions of 
the characteristics a and 6. The method developed in this 
paper consists of determining the approximation relations 
(by excess and by loss) of a/b with the aid of the sample 
means and of the triple of the sample variances corres- 
ponding to a and 6. Generally, this method yields a lower 
precision than that obtained by classical methods; never- 
theless, this method is very simple and does not need so 
many calculations. When the characteristics a and b have 
a correlation coefficient near to —1, the method given here 
yields a precision near to that obtained by classical methods. 
Furthermore, by making use of the same method the sample 
precision of a—2b/a+2b is determined. 

In the second part of this paper the sample precision in 
stratified samples is estimated. The paper contains many 
interesting and instructive numerical examples illustrating 
this method in comparison with the classical ones. 


(G. Simboan) 
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STEIN, C. (Stanford University, California) 4.4 (4.2) | 
An example of wide discrepancy between fiducial and confidence intervals—IJn English | 
Ann. Math. Statist. (1959) 30, 877-880 (8 refs.) 


: 
This paper considers the following situation: ‘“‘ Let X,, 
X», ..., Xn be independently normally distributed real 
random variables with unknown means &,, &o, .... €n and | 
unit variance. We wish to construct fiducial or confidence 
sets for 5 €%.”” These sets are constructed and it is shown 
that, for a large range of values of 2 e a fiducial interval 

for this parameter having a given fiducial probability has 

the property that it is practically certain the interval will 

not cover the true parameter value. The sense of this is 

that as n—>0o with 1/n? X 7-0, the probability that 2 é 
is covered by the fiducial interval tends to zero. 

The author concludes with several questions and com- 
ments which are aimed at the philosophy of fiducial prob- 
ability statements. He takes the position that it is relevant 
to consider the probability of an interval estimator covering 
the true parameter value, even though this position is 
opposed by many practitioners of fiducial inference. 


(D. H. Shaffer) 
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TRYBULA, S. 


(Math. Inst., Polish Academy of Sciences, Wroclaw) 


4.0 (1.3) 


The estimation of frequency in a population of elements belonging to classes not 


represented in the sample—Jn Polish 
Zastosowania Mat. (1959) 4, 244-248 (1 ref.) 


Consider a population divided into a finite or denumerable 
number of classes. Let p; be the frequency of the ith class 
in the population. Denote by p,,, and »,,, the total 
frequency of classes represented 7 times in a sample of size 
n taken according to a Bernoulli scheme and the number of 
such classes, respectively. It is proved that the random 
variable p,, , and the random variable 


o™ tg i . m lmt+-n+1 
rn r) poo \ RB) Rt t+Lntm+1 |R ferret 
(m= 051, 2; ---)s 
have the same expected value. Owing to this it is proposed 


to use the variables 2° as estimators of the total frequency 
of classes not represented in a sample. 


(S. Zubrzycki) 
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WIJSMAN, R. A. (University of Illinois, Urbana, U.S.A.) 


4.2 (4.3) 


On the theory of best asymptotically normal estimates—In English 


Ann. Math. Statist. (1959) 30, 185-191 (8 refs.) 


The author uses a definition of best asymptotically normal 
estimates which requires less stringent assumptions, while 
retaining all the important theorems in the usual theory of 
best asymptotically normal estimates. 

Let Z,, be a sequence of k-dimensional random vectors. 
The distribution of the Z, depends on an m-dimensional 
parameter vector 0, mk. The true value of @ is 0). It 
is assumed that the law »/Zn[,—((@o)] converges to the 
normal multivariate law, N[0, X(@))] in which X(@,) is a 
k by k positive semi-definite matrix. @ is estimated by 
OZ.) =O 

The usual best asymptotically normal estimation theory 
assumes that both ¢ and ¥ have continuous second derivatives 
whereas the author’s definition merely assumes & is con- 
tinuous and ¢ has a continuous first derivative, V; the 
other assumptions are unchanged, viz. ¢ is one-to-one and 
bicontinuous, £(9) is non-singular for every @ and the 
matrix 0/06 is of rank m for every 0. 0 is called regular (1) 
under the usual assumptions if 


(i) 6 is a consistent estimator of 0; 
(ii) 06/ éz exists and is continuous in the neighbourhood 
of ¢. 
A regular (1) estimate with minimal covariance matrix is 


called best asymptotically normal (1). The author points 
out that in order to generate best asymptotically normal (1) 
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estimates by minimising a quadratic form, or from the root 
of a linear equation, it is practically necessary to require 
that X is continuously differentiable and ¢ is continuously 
twice differentiable. 

0 will be called regular (2) under the new assumptions 
if (i) @ is continuous in each point of €; (ii) for each 0, 
there exists an mXk matrix A(Q), continuous in 0, such 
that /n[0,,—0,] converges in probability to 


A(>) Vn Zn — £(8o)], 


for all 0). A regular (2) estimate with minimum covariance 
matrix is called best asymptotically normal (2). ‘The class of 
regular (2) estimates contains both the regular (1) estimates 
and the estimates defined by Ferguson [Ann. Math. Statist. 
(1958) 29, 1046-1062]. Furthermore, the asymptotic 
properties of best asymptotically normal (2) estimates are 
the same as those of best asymptotically normal (1) whenever 
the latter exist. Thus the class of estimates considered is 
enlarged and if best asymptotically normal (1) estimate exists, 
it still belongs to the best estimate of this class. It is proved 
that A(0)V(0) =I, identically in 9. This theorem ensures 


the existence of the minimal covariance matrix and shows | 
that the class of covariance matrices of regular (1) and — 


regular (2) estimates coincide. The author states that with 
a few modifications the whole theory of best asymptotically 
normal (1) estimates holds for best asymptotically normal 
(2) estimates. 


(M. V. Menon) 
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_ BODMER, W. F. (University of Cambridge) 


5.5 (6.9) 


A significantly extreme deviate in data with a non-significant heterogeneity chi-square—In English 


Biometrics (1959) 15, 538-542 (10 refs., 1 table) 


The chi-square test for heterogeneity may have little power 
for a particular alternative. An approximate test is con- 
sidered for the alternative that one of the expected cell 
frequencies exceeds (or is less than) all the others, which 
are assumed equal. 

For a set of k binomial frequencies p;, each of the 
frequencies is transformed into an approximate normal 
variate. ‘Then Nair’s test for outliers is recommended to 
perform the desired test on the expected cell frequencies. 
A biological example is given.in which the over-all chi- 
square is not significant, but the extreme observed cell 
frequency is significant using this method. 


(C. P. Quesenberry) 
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CHANDA, K. C. (Bombay University) 


5.0 (—.-) 


On some simplification in the construction of similar regions—IJn English 


Bull. Calcutta Statist. Ass. (1959) 8, 159-161 (1 ref.) 


If 4 denotes the likelihood based on n observations of p+k 


‘parameters 6,, 9, ..., 94x, then to test the composite null 


hypothesis Hy : 0; = 0%, p+1S7Sp+k, against the com- 
posite alternative hypothesis H : 0; 4 0$, p+1S5iSp+, it 
is known that under the conditions given by Neyman [ Ann. 
Math. Statist. (1941) 12, 46] on 


5 log ¢ . 
a he = am 1 2) sesy 
fo (¢ | H,), Ps 50; $= ( 9 9 D) 
82 log $ ie 
es ie ae! 


_ the complete class of similar regions w of size « can be 


constructed. The most powerful similar region is such that 
inside w,(%) : 6 2Adbo 
where w,(y) is the part of wy inside the hypersurface $; = 
const. (i = 1, 2, ..., p) and A is suitably chosen. 
By setting ¢,,, = log $—log ¢o, and suitably trans- 


forming the variables 4, ..., p41, the author shows that 
under some conditions the most powerful similar region wo 
is such that 

inside wo : Apy1 =e (or Sc’) 


. Rie 
where ),, ..., A» ;1 are the new variables and c or ¢ is chosen 


suitably. 
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The method simplifies considerably the construction of 
similar regions. It is illustrated by applying the technique 
for testing the equality of means of k normal populations 
with equal variances against the most general alternatives. 
The author believes that the technique can be applied to 
more complicated cases with a considerable increase in ease. 


(T. V. Hanumantha Rao) 


GEISSER, S. 


(Nat. Inst. of Mental Health, Bethesda, Maryland) 


5.8 (7.9) 


A method for testing treatment effects in the presence of learning—In English 


Biometrics (1959) 15, 389-395 (6 refs.) 


The usual test for equality of effects of p different drugs is a 
univariate analysis of variance. A sufficient number of 
experimental units (subjects) each receives one of the p 
drugs as a treatment and the analysis is performed on the 
resulting data. ‘The author presents a multivariate alter- 
native to this test when the experimental units are scarce. 

The procedure is to apply all treatments to each experi- 
mental unit in order to obtain a reasonable number of 
observations. ‘The dependence between observations on the 
same subject violates the assumptions of the univariate 
analysis of variance and requires a multivariate approach. 
The observations on each individual are treated as a single 
vector observation. - 

To prevent confounding of the treatment (drug) effects 
with the time (order of application of drugs) effects a latin 
square design is used. ‘The multivariate T? statistic is used 
to test the hypothesis that all drugs have the same effect. 

The assumptions for this test are that each observation 

is a linear combination of the appropriate treatment effect, 

a time (learning) effect due to the order in which the drug 
is applied and an error term. It is further assumed that 
the vector of observations for the jth subject is distributed 
with mean vector p,; and variance-covariance matrix identical 
for each subject. 
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ISHII, G. (Dept. of Statistics, Atomic Bomb Commission, Hiroshima) 


On the exact probabilities of Rényi’s tests—In English 


The author suggests a Youden square arrangement to 
handle the problem of missing data and considers two 
possible alternative models, a multiplicative model and a 
partially multiplicative, partially additive model. ‘The author 
also states that the variance-covariance matrices V; of the 
p groups are identical (V; = V) under the assumption of a 
linear model and unequal (V; ~ V) if the true model is 
one of the above alternatives. Hence, in order to test the 
additivity of the model versus these alternatives an equivalent 
test of the homogeneity of the variance-covariance matrices 
V; may be used. The author refers the reader to such a test 
and suggests that the J? statistic might still be used if one 
of the alternative models is accepted. This leads to an 
approximate x? test. 


(H. O. Posten) 


5.6 (3.8) 


Ann. Inst. Statist. Math., Tokyo (1959) 11, 17-24 (9 refs.) 


In this paper two exact probabilities are obtained by the 
‘ author, 


(i) Pr{ Sup [F(x) —F,,(«)/1—F(n)]< «} and 
b= F(a) 
(ii) Pr{ Sup [F,(«) —F(x)/1—F(x)]< 6}, where F(x) is a 
b> F(a) 


continuous type distribution function and F’,(x) is 
an empirical distribution function based on a sample 
of size from the distribution F(x). 


Formerly Rényi (1953) treated the statistics 
Sup [F,(x)—F(x)/F(n)] and Sup [F(x)—F,(«)/1—F(x)], 
as F(a) 1—a> F(a) 
obtained their limiting distributions when 7 tends to infinity, 
and showed that these limiting distributions are distribution- 
free. The treatment by the present author differs from 
that of Rényi in the method employed: in particular, he 
considers the case where 7 is finite. The results show that, 
as in Rényi’s case, the exact probabilities are also distribution- 
free; that is, independent of the form of F(x). 

The author states that these results will be useful for a 
test of goodness-of-fit in life testing. [See also: Ishii “ Test 
of fit in life testing ” Ann. Inst. Statist. Math., Tokyo (1957) 
9, 117-125 and “ Kolmogoroff-Smirnoff test in life test”’ 
Ann. Inst. Statist. Math., Tokyo (1958) 10, 37-45]. 

The method of his proof can be summarised as follows. 
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Let the ordered sample of values from F(x) be x1,x2...Xn, 

and X a random variable following F(x). Transform x; by 

F(x), and put U,; = F(x,). Then, U; is a realised value 

of the random variable U; = F(X) which follows the 

uniform distribution over the interval [0, 1]. By this trans- 

formation, the relation Sup [F(X)—F,(X)/1—F(X)]<e« 
b> F(a) 


is changed into Sup [1—G,(U)/1—U]<«, where G,(U,) = 
b=U 
(¢—1)/n. 


The latter relation is equivalent to the following (R) on 


U,, Uz, ..., Un under the condition that U;,<b< Um_1. 
(R) U;<[G@—-1)/n+eJ/1+e -iftSk+1 
U,;<6b if R+1<iSm. 


where & is an integer such that {k/n+c¢}/1+.eSb<{(Rk+1)/ 
n+e}/1+e. For a fixed number 0(0<6S1), a suffix m with 
Um<b<Um.1 is also a random variable which follows the 
binomial distribution B(b, n). The joint distribution of 
U,, .2,.U,,, and m isn! /(n—m)!(1—b)"-"du,,; dus, ..., Dian. 
By integrating this under the above mentioned restriction 
(R) and summing them up for all m with n=mzk, the 
author obtains his results. Concerning the above integration, 
several lemmas are stated and the main lemma is proved. 

The exact value of the other probability (ii) is obtained 
analogously. 


(Y. Suzuki) 


——e eee ee | | | et or ne ree rarer Sere ayy 


KAPUR, M. N. 


(Indian Council of Agric. Res., New Delhi) 


5.4 (-.-) 


A property of the optimum solution suggested by Paulson for the k-sample slippage problem 


for the normal distribution—IJn English 


J. Indian Soc. Agric. Statist. (1957) 9, 179-190 (1 ref.) 


In 1952, Paulson suggested [Ann. Math. Statist. 23, 610-616] 
a test criterion for testing the hypothesis that k given normal 
populations are identical against alternatives under which 
the mean of one is greater than the mean of all the others. 
If x1, X%2, .... X, are samples of size one from each of the k 
populations, and if S? is an independent estimate with n 
degrees of freedom of their common variance, the procedure 
suggested is to take the decision D, that the ith population 
has the greatest mean if 


Cy] nS" H+) 2 hb oe G=1, 8) 


and to take the decision D, that all populations have the 
same mean when 


(24,8) [Pest +or44) 2 wal '<e 


where x,, = Max(x, ..., Xx), % = (x1 +...+%,)/R. 


Paulson showed that this procedure was unbiased. In 
this paper the author proves that the test procedure is 
unbiased even when the class of alternatives is the wider 
class of k normal populations with equal variances but 
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KONIJN, H. S. (University of Brame: Australia) 


which contains at least one pair with unequal means. To 
be more precise, the author proves the following :— 


G@) P[D) | Wy]2ZP[Do | w] 
Gi) PLD; | wi) P LD; |w,) 4; 7 = 1, 2,...., & CHF) 
Git) -P[ D7 |-2,) 2 PD | wolk@=1, 25... R). 
where w,(z = 1, 2, ..., k) is that portion of the parameter 
space where the zth population has the greatest mean. 


(S. John) 


5.6 (5.3) 


Some nonparametric tests for treatment differences in paired replications—In English 


Y. Indian Soc. Agric. Statist. (1957), 9, 145-167 (50 refs.) 


Consider n pairs of random variables (W,, V;)(i = 1, 2, ..., n) 
and the hypothesis H, that for each 7 the distribution of 
Z; = W;—V; is symmetrical about zero against the alter- 
native that it is symmetrical about some 0; #4 0.. For 
example W; and V; might be the responses of the treated 


and untreated members in the 7th pair of experimental units 


and H, the hypothesis of ‘no treatment effect”. The 
alternative envisages a situation where the treatment effects 
the ith pair only in so far as it shifts the response of its 
treated member by 0;. 

Let ‘z, be the observed values of Z;. Under Ho, the 


~ number m of positive z’s obeys a binomial distribution with 


parameters 3 and n and the division of the observed differ- 
ences into those with positive and negative signs constitutes 
a random partitioning of | z | *s, It has therefore been 
pointed out that any two-sample test which is symmetric 
in the observations, can be used to construct tests for the 
symmetry hypothesis. The Fisher-Pitman randomisation 
analogue of the paired t test can be (trivially) seen to be a 
test of this type, and in fact Wilcoxon’s matched pair signed 
rank test is derived thus from the two-sample rank sum test. 
One might similarly extend the Median test or the Mann- 
Whitney test. 

In this paper Konijn studies the Pitman asymptotic 
power of all these tests and also the sign test which has no 
counterpart in the two sample theory. In deriving his 
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expressions for the local asymptotic power, Konijn assumes 
that the shift 0; is the same (numerically) in all the pairs 
and that the different pairs of random variables are in- 
dependently distributed. The following comments are 
made: 

(i) Sign test. In general the efficiency of the sign test 
is highly dependent on the local properties of the density 
of the distribution and has a greatest lower bound of zero, 
so that the sign test is not in general to be recommended 
as a symmetry test against shift alternatives. In the normal 
case (that is when the Z,’s are normally distributed with the 
same variance) the sign test has a Pitman local asymptotic 
efficiency of 2/7 = 0-637 (approx.) with respect to the 
randomisation test. If the different pairs are identically 
distributed and observations within pairs are independent, 
the local asymptotic efficiency has a minimum of 0:576. 

(ii) Rank sum and Mann Whitney test. In the normal 
case, the local asymptotic efficiency with respect to the 
randomisation test are 3/7 = 0:995 (approx.) for the rank 
sum test and 0:32 for the Mann-Whitney test. 

(iii) Median test. In the normal case it has a local 
asymptotic efficiency of 0-81 with respect to the randomisa- 
tion test. In general, however, its efficiency, like that of the 
sign test, has the undesirable property of depending strongly 
on the local properties of the density. 


(S. K. Mitra) 


tt 


KONIJN, H. S. (University of Sydney, Australia) 5.2 (6.0) 
Positive and negative dependence of two random variables—In English 
Sankhya (1959) 21, 269-280 (5 refs.) 
Given the marginal distributions G and H of two random ‘The author derives classes within which the first four 


variables X, Y, Frechet [Amn. de l Univ. Lyon (1951), 53-77] 
proved the existence of a joint distribution function F,(F_) 
which is the uniformly highest (lowest) valued among all 
‘possible joint distribution functions of X, Y and which 


statistics yield consistent tests of independence. 

P, T1, T2 and 73 are computed for positively and negatively 
k-dependent random variables. Further, it is shown that 
the tests based on unbiased estimates of the 7 are unbiased. 


Or) en A) Re en ee EEE 


implies an almost sure non-decreasing (non-increasing) 
relation between X and Y. ‘The random variables are 
defined to be completely positive (negative) dependent if 
there is an almost sure non-decreasing (non-increasing) 
relation between the two and positively k-dependent if 
their joint distribution can be expressed as (1—A) Fy) +kF,, 
where F, = GH is the joint distribution of the variables if 
they are independent. Negative k-dependence is similarly 
defined. This paper discusses the properties of such 
variables and the properties of various tests for independence 
against such types of dependence. 

The test statistics considered in this paper are the 
estimates of 


(i) Correlation coefficient p, 

(ii) Spearman’s rank correlation 79, 
(iii) Difference sign correlation 7, 
(iv) Grade correlation 72, 

(v) Medial correlation 73. 
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KUIPER, N. H. (Landbouwhogeschool, Wageningen) 
Tests concerning random points on a circle—In English 
Proc. Kon. Ned. Akad., Wetensch. A (1960) 63. 


Powers of these tests against alternatives in k are obtained 
and also their asymptotic power when k is near zero. ‘The © 
latter turns out to be independent of the marginals. 


(K. Rajaraman) 


5.6 (1.4) 


Indag. Math. (1960) 22, 38-47 (8 refs., 3 tables, 1 fig.) 


Let M = (¢4, ..-) $m) be a sequence of angles (0 <4; < 27) 
representing compass-directions into which m birds have 
been seen flying on migration at a given place and time. 
In this paper tests concerning the null-hypothesis that di, 
..., 6m are independent values of a known random angle ¢ 
(problem 1), and tests for the hypothesis that two sets M 
and M? belong to the same unknown random angle (problem 
2) are studied. 
In the first part of this paper three obvious methods are 
considered: 
(i) In the case where the probability mass is concen- 
trated in*a small ¢-interval, the circle of angles 
(mod 27) can be cut, and methods for ordinary 
random variables applied (problem 2). 
(ii) The interval 0<¢<27 can be divided in sub- 
intervals and the chi-squared goodness-of-fit test 
can be applied (problem 1). 
(iii) The random vector z = (cos 4, sin ¢) can be con- 
sidered as a two dimensional random vector (problem 


1 and 2). If ¢ has a uniform distribution then the 
m 

statistic 2 2; is asymptotically ‘ isomorous eet 
t=1 

x. V (km) where “ isomorous ” stands for “ having 

the same cumulative distribution function’? and 


chi-square is the two-dimensional normal random 
vector with expectation zero and unit covariance 
matrix. Estimation problems are also considered 
here. 


In the second part of the paper, non-parametric tests 
on the circle related to the tests of Kolmogoroff and Smirnoff 
are introduced and studied. A statistic V, and a statistic 
Vn,m are suggested and tables of critical regions are given 
for the V,-test (problem 1) and the V,,,-test (problem 2). 


(N. Kuiper) 


SIOTANI, M. (Inst. Statist. Math., Tokyo) 


The extreme value of the generalised distances of the individual points in the 


multivariate normal sample—In English 


5.5 (5.9) 


Ann. Inst. Statist. Math., Tokyo (1959) 10, 183-208 (12 refs., 3 tables, 2 figs.) 


This paper is concerned with the extreme value of the 
generalised distances from the origin of N individual points 
which may be correlated with each other in a p-variate 
normal sample. Let Y,(a = 1, 2, ..., N) represent N 
p-variate vectors with mean zero and with covariance matrix 
YA(Y>0), where A is a symmetric, positive definite matrix. 


‘Further, let the covariance matrix of Y, and Y,(«a4 8B) be 


SA, where | 8 |< Y, and define the quadratic forms 
Pin =O. aid YL Y, = Ow where L is an unbiased 
estimator of A. 

The statistics treated in this paper are OQna,= max (Q,) 


s be gh a 
and Qmax= max(Q,). The extreme deviate from the 
(e4 


population mean or from the sample mean in the sample 
of size N drawn from the multivariate distribution, the 
extreme deviate from the control variate and the range in 
the multivariate case are considered as special cases of Omax 
and Omax- 

The main problem discussed in this paper is the obtaining 
of asymptotic formulas by which good approximate upper 
percentage points of Qmax and Qmax can be calculated when 
Y has the p-variate normal distribution. For this, 
Pr[Q, >a®, Og >a?] and P1[O, >B,?, Og >B,*), where «#8, 
are evaluated. ‘The former is calculated from the bivariate 
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STERLING, T. D. (University of Cincinnati) 


chi-square distribution and the latter is asymptotically 
evaluated in terms of v~1, where v is the number of degrees 
of freedom of L, by applying a slightly modified method 
by James [Biometrika (1954) 41, 19-43]. The extreme 
deviate of N individual points from the sample mean is 
given in some detail. ‘Tables of the upper 5, 2:5 and 1 per- 
centage points are given for p = 2, 3, 4; N = 3(1)10(2)20, 
25, 30 when A is known and for p = 2; N=3(1)12, 14; 
v = 20(2)40(5)60, 100, 150, 200 when A is unknown. 
Finally, the author comments on the evaluation of upper 
percentage points of the square of the range. 


(Y. Suzuki) 


5.0 (11.0) 


Publication decisions and their possible effects on inferences drawn from tests of significance— 


or vice versa—In English 


¥. Amer. Statist. Ass. (1959) 54, 30-34 (8 refs., 2 tables) 


‘In this paper the author discusses three theses: 


(i) Experimental results will be printed with a greater 
probability if the relevant test of significance rejects 
H, for the major hypothesis with Pr(E | Hy) <0-05 
than if they fail to reject H, at that level. 

(ii) The probability that an experimental design will 
be replicated becomes very small once such an 
experiment appears in print. 

(iii) A great many more experiments are performed 
than appear in the pages of professional journals. 


These theses are partially supported by a survey of 
362 recent research reports in psychology which revealed 
that of the 294 that used tests of significance, only 8 failed 
to reject some ‘‘ major” hypothesis (complicated con- 
ventions attach to the word “major’’), and none were 
reports of replications of previously published experiments. 

The preference for publication of significant rather than 
insignificant results may be either official journal policy or 
merely an implicit agreement among authors. In either 
case research leading to non-significant results may be 
repeated by various workers, each unaware of earlier negative 
results, until by chance a significant result occurs, leading 
to publication of an “‘ error of the first kind.” 
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The malpractices are not the private domain of psycho- 
logy but are widespread also in experimental journals in 
biology, chemistry, medicine, physiology, and sociology. 
Some onus appears to be attached to the reporting of negative 
results. 


(R. J. Buehler) 


first and second alternative, respectively. 


STEYN, H. S. (University of South Africa, Pretoria) 


5.2 (2.4) 


On x’-tests for contingency tables of negative multinomial type—In English 


Statist. Neerlandica (1959) 13, 433-444. 


Usually, when in a contingency table a y?-test is used to 
test independence between the classifications, the total size 
of the sample is constant. The author treats the case where 
the number of observations in one of the cells is fixed and 
consequently the total size of the sample is a random variable. 
This situation occurs, for instance, when one expects one 
cell to have a very smail probability and decides to continue 
sampling until a given number of observations in this cell 
is reached. Instead of the multinomial distribution con- 
nected with the ordinary contingency table the distribution 
of cell frequencies is now negative multinomial [a generalisa- 
tion of the one-dimensional Pascal distribution, see: Steyn, 
Proc. Kon. Ned. Akad., Wetensch. A (1951) 54 and (1955) 58). 

Let us in a h by k contingency table denote by p,; the 
probability of an observation to fall in cell c;;, by m the 
fixed number in cell cy) and by f,;; the number of observa- 
tions in cell c,;; According to the negative multinomial 
distribution the expected frequency in cell c,; is mp;;/Poo 
and the expected total number of observations m/po 9. The 
expression x”, formed by summing up (f;; —&f,;)?/E@f;; over 
all cells and subtracting the corresponding term for the 
total number of observations, has (asymptotically for m—>o) 
a x?-distribution with hk—1 degrees of freedom as Steyn 
proved in 1955. If the classifications are independent: 
Di; = Pip; In the present paper two methods are given 
to derive estimates for the unknown parameters p; and p;; 
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UHLMANN, W. (Inst. fiir Versich.-math. und Math. Statist., Hamburg) 


On a nonparametric test by Lehmann—Jn German 
Metrika (1959) 2, 169-185 (6 refs.) 


_ In 1953 [Ann. Math. Stat. 24, 23-43] Lehmann showed that 


the Wilcoxon (Mann-Whitney) test is the locally most 
powerful rank test for the hypothesis F(x) = F,(x) against 
the alternative F,(x) = (1—p)F(x)+pF (x), F(x) = F(x). 
In the same paper, Lehmann suggested a two-sided rank 
test which is locally most powerful for the alternative 
“ F(x) = (1—p) F(x) +pF («)®, F(x) = F(x) or Fy(x) = F(x), 
F(x) = (1—p)F(x)+pF(x)?””. The power is defined in 
terms of the “average” power function B(p) = 4$(8,(p) + Bo(d)), 
where f,(p) and £,(p) are the power functions under the 
As p’(0) = 0, 
the locally most powerful test is characterised by f’’(0) = 
Max. The test obtained in this way is, in general, different 
from the two-sided Wilcoxon test. 

This- present paper contains a table for this test with 
critical regions for all levels up to 0-05 for the sample sizes 
4(1)9. Several lemmas of importance in connection with 
the computation of this table are given. It is seen that, 
for sample sizes up to 9 and levels of significance 0:05, 
to each critical region of the two-sided Wilcoxon-test there 
corresponds a critical region of Lehmann’s test at the same 
level. But Lehmann’s test has additive critical regions, so 
that a given level of significance could be approached more 
accurately which might be important in the application of 
non-randomised tests. 


(J. Pfanzag]l) 
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the first using the row and column totals and the second 
using the method of modified minimum x? [see Cramér, 
Mathematical Methods of Statistics (1954)]. In both cases 
the resulting expression after substituting the estimates in 
x has asymptotically a y?-distribution with (h—1)(k—1) 
degrees of freedom and thus can be used to test independence 
between classifications. The proof of this statement is 
indicated. The author especially treats the fourfold or 
2 by 2 table. 


(Gerda Klerk-Grobben) 


5.6 (5.3) 


Remark of the Regional Editor: 

The abstractor is of the opinion that there exists a 
concept of “‘ optimality ”’ (different from the one based on 
the “‘ average’ power function, as used by Lehmann and 
Uhlmann), for which the two-sided Wilcoxon-test turns 
out to be optimal for the problem stated above. A short 
note on this subject will appear in Metrika (1960) 3. 


(J. Pfanzagl) 


Let &, 


VAS, E. (Math. Inst., Hungarian Academy of Sciences, Budapest) 


5.7 (8.7) 


On the efficiency of the sequential probability ratio test—In English 
Publ. Math. Inst. Hung. Acad. Sci. (1959) 4, 171-181 (6 refs., 3 tables) 


The author improves some results of Wald [Sequential 
Analsysis, (1948) New York, Wiley] concerning the average 
sample number of his sequential probability ratio test. In 
the relevant formulas of Wald the following quantities 
appear: 


& = sup M{z—r | 6; z2n, 
r>0 


& = inf M{z+-r | 0; z5—-7}, 
7>0 


fo = sup M{(z—r)? | 0; z=7}, 
r=0 

C, = sup M{(z+r)? hon ae, 
r>0 


where 2 = log { f(x, )/fl«, 4). 

Here f(x, @) denotes the probability density function of the 
observed random variate x, 9 denotes the unknown para- 
meter, 6) and 0, its values for the two hypotheses. Wald 
calculated ¢, and €, for the normal and binomial distribution, 

The author proves that if ¢ has a logarithmically concave 
density function (in the interval of its possible values) then 
the above extreme values appear at r = 0.. This condition 
is satisfied if the parent distribution belongs to several 
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VINCZE, I. (Math. Inst., Hungarian Academy of Sciences, Budapest) 


important distributions, among them being the case of the 
normal parent distribution. For this case the author’s 
result simplifies the calculations and gives better results. 
Tables are given for the approximate values and the lower 
and upper limits of the efficiency (average sample number) 
of the sequential probability ratio test for the case where 
the parent distribution is the normal distribution. 


(K. Sarkadi) 


5.6 (5.3) 


On some joint distributions and joint limiting distributions in the theory of order statistics, [I—Jn English 
Publ. Math. Inst. Hung. Acad. Sci. (1959) 4, 29-47 (3 refs., 1 table) 


..-5 €n and 7, ..., Nn be samples from populations 
having the distribution functions F(x) and G(x). Let us 
denote further by F’,(x) and G,,(x) respectively the empirical 
distribution functions of these samples. 

Let em) in the following denote those smallest values 
of x, for which the function F,,(«+0)—G,(x-+0) takes its 
maximum. 

Let us put further 


—N<7<0 
Re, = UF? +0) +G,(E? +0)] 


In a previous paper [Publ. Math. Inst. Hung. Acad. Sct. 
(1957) 2, 183-209] the author derived the joint distribution 
of the pair of random variables (Dy, i Re ») as determined 
under the assumption F(x) = G(x). This paper derives 
the generating function of the probabilities Pe = P{Di, = 
k/n, Ke n = A/2n} and some of its consequences. 

Using the notation PY = 0, the author states the 
generating function which has the form of a triple sum- 
mation in n, k and A. The marginal distribution of the 


probabilities with regard to k gives the distribution of 
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Gnedenko-Koroljuk. 
identities 


In section 3 a direct proof of the 


() 


=* Pp) = 2k-+1 
ey n+k+1 (Fa 


n 


is given and the conditional expectation of the random 
variable Rj#,, under the condition Din = Rin is also 
determined. ‘The author then introduces the following 
notation: 

Bs 2) po) 
and in the fourth section the following relation is proved 
forgi=="4, 2am 

P Mn. P oH 
A table is given for the probabilities Pa where n = 10. 


(E. Csdki) 


WALSH, J. E. (System Development Corporation, Santa Monica, California) 
Large sample nonparametric rejection of outlying observations—In English 


5.5 (5.6) 


Ann. Inst. Statist. Math., Tokyo (1959) 10, 223-232 (4 refs.) 


The author presents a method of nonparametric rejection 
of outlying observations. When x,S... <x, are independent 
observations from continuous populations, the null hypothesis 
is that these observations are a sample from one and the 
same population. The alternate hypothesis is that the 7 
smallest observations are too small (or that 7 largest observa- 
tions are too large) to be consistent with the null hypothesis. 
However the continuous populations considered here 
are slightly restricted to types in which after the paper 
of David & Johnson [Biometrika (1954) 41, 228-240], 
the mean and variance-covariance expansions, and some 
related ‘Taylor expansions are valid. The test criterion 
is that the alternative is accepted if a statistic of form 
x;—(1 + A)x;,,+Ax;, is negative (or if x,,, ,—-(1+A)xn_~+ 
AxXni1-% iS positive) where A>0, k is the largest integer 
which is less than (¢+ »/2n) and 7 is sufficiently large. 
Consider a statistic of the form 
X = x,—(14+A)x;+Ax,, L<j<k and A>0 

then the problem is to select 7, k, and A for given values 
of z and n, so that a satisfactory nonparametric test is 
obtained. The relation 7 = i+1 is established by requiring 
that {1-+O(1)} Var (X | Hy) be minimum. The values of 
k and A are determined by this requirement and the 
additional conditions that A >0 and under H, 


E(X) = kv/var (X) (1+0(1)}, 
where K is specified. 
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WILLIAMS, E. J. (North Carolina State College, Raleigh) 
The comparison of regression variables—In English | 
F. R. Statist. Soc. B (1959) 21, 396-399 (2 refs.) 


The problem discussed is that for which Hotelling originally 
put forward a test of significance. When a regression 
relationship for predicting the values of a variable is to be 
determined, and two or more regression variables have been 
measured, it is usually thought appropriate to calculate the 
multiple regression in order to obtain the predicted values. 
Sometimes it so happens that a choice of variables to be 
used for the predictor has to be made, and Hotelling’s test 
was concerned with comparing the efficiency of the different 
regression variables. The author discusses Hotelling’s test, 
derives it anew and expresses it as a test of regressions. It 
is pointed out that a test proposed by Healey is different 
from that of Hotelling. 

A sample of n values of each of p regression variables 
X41, Xp) ...) Xp is supposed, and y an independent variable: 
z, is the square root of the sum of squares about the regres- 
sion of y and x;. The test criterion is the sum of squares of 
the deviations of the z; from their weighted means. This is 
tested against the residual mean-square. ‘The author gives 
an analysis of variance table for splitting up the various 
sums of squares. 
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An approximate upper bound for its significance level 
when 7 is not too small, is obtained by the relation 


Pi{x;—(1 +A) x41 + Ax, <0} 
= P{X—&(x)/V var (X)< —K+O(1)} <1/K?+0(1) 


under the null hypothesis. In the derivations, two assump- 
tions are used, the first of which is introduced for deriving 
the above inequality and the second for the sake of mathe- 
matical completeness. 

A two-sided test is obtained as a combination of these 
one-sided tests, and some power function properties of 
them are considered for the case where the observations are 
independent and are from populations with fixed shapes. 


(H. Hudimoto) 


5.8 (6.1) 


The limitations of Hotelling’s test are noted. It is a 
conditional test and is considered not suitable for drawing 
conclusions about the future behaviour of the independent 
variables. The author proposes an approximate test for 
comparing two predictors. 


(Florence N. David) 


AOYAMA, H. (Inst. Statist. Math., Tokyo) 


On the evaluation of the risk index of the railroad crossing—In English 


6.4 (6.9) 


Ann. Inst. Statist. Math., Tokyo (1959) 10, 163-180 (23 tables, 4 figs.) 


The author investigates the traffic volume at railroad 
crossings which belong to the Japanese National Railways, 
and he studies the method of calculating the risk index of 
crossings. 

There are four kinds of crossings classified according 
to their physical equipment. Crossings of 2nd kind are 
not included in this paper, because they are very rare. 
Firstly, twenty-three characteristic factors of the railroad 
crossings are considered. Later, by comparing the distribu- 
tion of the crossings of the 4th kind without an accident 
with that of crossings of the same kind with an accident 
and using the chi-square test, 8 main factors are selected. 
Numerical values to be given to the categories of these 
factors are calculated by maximising each correlation ratio. 
After these operations, the risk index y of any crossing is 
given by the equation with unknown weight w,, 


BY) = Wb =P ns ap hos ARORA 
where x; is the numerical value of the category taken by 
the 7th factor of the crossing. ‘The weight (w,) is calculated 


by the method of linear discriminant functions. ‘The author 
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ARMITAGE, P. (London School of Hygiene) 
Host variability in dilution experiments—In English 
Biometrics (1959) 15, 1-9 (11 refs.) 


Following Moran [¥. Hyg., Camb. (1954) 52, 189-193 and 
’ 444-446] and Armitage & Spicer [¥ Hyg., Camb. (1956) 
54, 401-414] a re-examination is undertaken of the theory 
of experiments in which various concentrations of infectious 
particles are administered to groups of host organisms and 
where the response variable is the proportion p of hosts 
infected at each dose. Interest centres about the density 
function f(p): estimation of parameters of f(p) and testing 
the null hypothesis Hy: p = constant. 

Correction is made of a previous assertion that a 
maximum likelihood approach was inappropriate. It appears 
‘that second derivatives of the likelihood function (given Hy 
true) do converge. Discussion of the maximum likelihood 
approach and of certain tests of H) are given. 

For analytical convenience (although theoretically objec- 
tionable except as an approximation) it is assumed that, 
for a given dose of infectious particles, f(p) has a Type III 
distribution. The likelihood equations and expectations 
of the second derivatives are found. On H, these expecta- 
tions are also given. When the number of hosts are constant 
for each dose and when dilutions are two-fold over a wide 
range, the summations appearing in the expectations on Hy 
are close to the corresponding integrals. These are evaluated 
giving very simple expressions involving a single parameter 
y. The asymptotic test of Ho involving maximum likelihood 
estimates of parameters VY and y and the expectations is 
mentioned. An asymptotically equivalent test on Hy is 


426 


defines the coefficient of accident by the percentage of the 
number of crossing with accidents to that of all crossings 
having the same value of risk index and he gives the relation 
between it and the risk index. The risk index or the co- 
efficient of accident is used as a criterion to alter the kind 
of crossings. 


(Y. Suzuki), 


6.8 (4.2) 


described not requiring evaluation of the maximum likelihood 
estimates. In this test use is made of a consistent but 
inefficient estimate. Fisher’s estimate for the “ dilution 
method ”’, although not strictly consistent, is proposed as 
an alternative. 

The data of Parker treated earlier is analysed: Hy is 
strongly rejected and hence asymptotic equivalence of the 
tests is not attained. An example, artificially obtained, 
with H, nearly true is examined. The maximum likelihood 
test described above is found to be more efficient than 
two quick tests earlier proposed by Armitage & Spicer. 
This is due to the flattening of the response curve occurring 
even with small variability in p. 

With regard to the choice of estimation versus testing 
H,, two situations are considered: (i) the particles are of 
two kinds, infective and non-infective; (ii) some of the 
particles at least may be either infective or inert. In 
situation (i) it is reasonable that Hy should be true; however 
for situation (ii) H,» is inherently implausible and hence 
would never be tested. When the situations cannot be 
distinguished on a priori grounds a test of Hy is tantamount 
to a test of situation (i). Slight departures from situation 
(i) may require modification of f(p). The truncated 
exponential distribution is considered and is found to lead 
to the same test for Hy. It is suggested that the proposed 
test has validity unrestricted by strong assumptions about 
the form of f(p). 

(M. E. Turner) 


ASHFORD, J. R. (National Coal Board, London) 


6.8 (4.3) 


An approach to the analysis of data for semi-quantal responses in biological 


assay—In English 


Biometrics (1959) 15, 573-581 (8 refs., 1 table, 2 figs.) 


In connection with the problem of relating an observed 
quantal response to an underlying quantitative reaction 
Hewlett & Plackett [Biometrics (1956) 12, 72-78] have shown 
that the quantal dosage-response relationship may be 
derived from the corresponding quantitative dosage-response 
relationship. 

The purpose of this paper is to examine the application 
in reverse of some of the results given by Hewlett & Plackett 
in order to determine the effect of errors of measurement 
of both dosage and reaction on the quantal relationship. 
The paper also examines the problem of estimating the 
parameters of the underlying quantitative dosage-reponse 
relationship from the data of a multinomial quantal response. 

The author discusses briefly the corresponding quanti- 
tative and quantal relationships. The effects of errors of 
measurement are discussed in some detail, and it is pointed 
out that observations of the response are likely to be in 
error, resulting in an increase in the variability of the 
quantitative dosage-response relationship. In some cir- 
cumstances the dosage metameter may be subject to errors 
of measurement: this will in turn modify the relationship 
between the observed dosage metameter and the observed 
response metameter. "The means and variances of the 
dosage and response metameters under the condition of 
linear relationship are given. Finally, under the above 
conditions, the author states that the “‘ apparent’’ dosage 
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response relationship is also linear although the slope of the 
regression line is reduced. 

The quantal responses most frequently encountered in 
practice are of the binomial type, but there are some 
applications in which the underlying quantitative response 
scale may be subdivided into a number of ordered and 
mutually exclusive intervals and it is necessary to consider 
a multinomial response. At this point the author demon- 
strates expressions and calculations using maximum likeli- 
hood estimates and shows that the parameters associated 
with the underlying quantitative response may be estimated 
from the semi-quantal response. This statement is valid 
only if it is known which parameters are to be estimated: 
the method of analysis depends on the validity of the large 
number of assumptions which it is necessary to make. 

The practical example used to illustrate the author’s 
procedure was provided by an investigation of the causes 
of pneumoconiosis among coal miners from collieries in 
England. An iterative procedure was employed to solve the 
maximum likelihood equations and estimates of the 
parameters were obtained. 

From the figure showing parallel quantal response lines 
the author concludes that better estimates of these lines 
may be derived from this method of analysis than would 
be obtained by a consideration of each response separately. 


(J. J. Bartko) 


6.0 (8.7) 


Some statistical problems in the construction of consumption scales—In Spanish 


Trab. Estadist. (1959) 10, 63-73 (11 refs.) 


A mainly expository paper on consumption scales and 
related statistical questions. The author considers con- 
sumption scales as a special case of the general problem of 
measuring clusters or composite units when simpler units are 
not clearly defined. An attempt is made to define such 
units in order to measure clusters, trying to make the 
definition practically efficient from the point of view of 
information and cost. 

In the special case of consumption (in economics) 
reference is made to the previous work of Woodbury and 
others. In relation to food consumption and the corres- 
ponding scales based in requirements, the lack of a general 
acceptance of levels as to specific necessities is emphasised. 
For example, as to the requirements of proteins of animal 
and vegetable origin. 

A brief consideration is given to the origin of Engel 
curves as the relation between the expenditure or con- 
sumption of some item and the income or total expenditure 
in the families of a certain population. The work of 
Sydenstricker and King is reviewed as well as their termin- 
ology and concepts based on the use of “‘ ammains 2 RNG 
similar units. 

A more detailed exposition is made of the method due to 
Friedman who takes, as the unit of scale, the married 
couple instead of the adult male because of improved 
information and less irregularity in the distribution of 
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budgets. The Friedman method based on the fitting of 
a line to variables made independent of the number of 
consumption units is discussed—also the advantages of 
considering a continuous variation of consumption with 
the age, within each sex. A disadvantage of this method 
is the use of the same scale in the computation for the item 
considered and for the total expenditure. However, when a 
weighted average of the units for each item is taken as the 
unit for the total, the method loses its simplicity. A different 
approach is used by Houthaker and Prais. ‘They state the 
Engel function in a form in which the independent variable 
is the income (or total expenditure) per unit and the 
dependent one the same as in the method by Friedman. 
Making two different assumptions about the form of the 
function, the values of the units can be computed by a 
successive approximation procedure. A different type of 
function could be used—as pointed by the author who also 
stresses the practical convenience of the method used by 
Brown who makes a previous classification of families into 
types of constant composition. 

Finally, attention is given to the possibilities of extension 
to different fields, such as productivity. 


(S. Rios) 


DALL’AGLIO, G. (University of Rome) 


On the compatibility of two-dimensional distribution functions—Jn Italian 


6.0 (—.—) 


R.C. Mat. Univ. Roma (1959) 18, 385-413 (7 refs., 3 tables, 3 figs.) 


Three two-dimensional distributions, respectively of the 
pairs of variables (x, y), (x, 2), (y, 2), are said to be compatible 
(“ compatibili ’’) if a three-dimensional distribution exists, of 
which they are the two-dimensional marginal distributions. 
This paper studies the conditions for the compatibility, 
related to the distribution functions. 

After giving a necessary condition, the author studies 
the class C of the distribution functions of the couple 
(y, 2) compatible with two given two-dimensional distribu- 
tions of (x, y) and (x, z). Under the evidently necessary 
condition that the two given distributions have the same 
marginal distribution for the variable x, the class C is not 
empty, and the following statements hold: 


(i) C is a sub-class of the ‘‘ Fréchet class’ of the two- 
dimensional distribution functions which have as 
marginal distributions the distributions of the 
variables y and z. 


(ii) The class C has a maximum and minimum function 
(maximum and minimum in every point). There 
are, however, distribution functions between the 
minimum and the maximum ones which do not 
belong to C. 


If two functions belong to the sub-class C, all their 
linear combinations with non-negative coefficients 
having a unit sum are also in class C, 


(iii) 
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DAVID, H. A. & ARENS, B. E. (Virginia Poly. Inst., BI 
Optimal spacing in regression analysis—In English 


acksburg) 


In the second part of the paper, the class C is studied 
when the associations between x and y and between x and 
& are of particular types. In the case of independence, the 
class C evidently attains its maximum, i.e. it coincides with 
the ‘“‘ Fréchet class”. This can occur also if y and zg are 
not independent of x. If both y and zg are connected to x 
by a distribution function, maximum in the Fréchet class, 
the distribution function of (y, 2), maximum in the Fréchet 
class, belong to C, and it is only the element of C under 
some large conditions. 

It is sufficient, for instance, that the distribution function 
of x be continuous. The same results occur if both the two 
given two-dimensional distribution functions are minimum: 
the maximum function belongs to C, and is the only element 
of C under analogous conditions. 

In the last part of the paper the results found are used 
in order to determine the minimum and maximum values, 
in C, of the moments about the line y =x. Later, the 
minimum and maximum values of the correlation coefficient 
ry are found, i.e. the extreme values of the correlation co- 
efficient between two variables, when each of them is con- 
nected to a third variable by a known law. 


(G. Dall’aglio) 


6.1 (0.2) 


Ann. Math. Statist. (1959) 30, 1072-1081 (8 refs., 2 tables, 1 fig.) 


The bulk of this paper is concerned with the following 
problem: suppose y is a quadratic function of x, where x 
is a variable that is controllable without error and y is 
observable with constant variance o?. It has been decided 
to approximate y as a linear function of x, perhaps because 
of knowledge that the quadratic component is small. The 
problem is to select the values of x so that the observations 
(x;, y;), in an experiment with a fixed number of observations, 
will provide the ‘‘ best” estimate of the actual y value. 

The authors first discuss this problem in general, where 
y is an arbitrary function of x. In terms of the true depend- 
ence of y on x, conditions for optimal spacing of x for a given 
approximating function are derived. They treat the situation 
where the observations y are made without error, and 
obtain two different sets of x values for optimality. One 
corresponds to the condition of closest overall fit, and the 
other to the condition of minimising the maximum deviation ; 
both are satisfied only approximately. With this back- 
ground established, the remainder of the paper is devoted 
to the particular problem already mentioned. 

In the problem of the quadratic being fitted with a 
straight line, the authors show how the optimum position 
can be achieved with only two values of x, and then proceed 
to demonstrate that these two locations, for either criteria 
of the optimum position, are a function of the ratio a’ to 
the true quadratic coefficient in y, where C= Loan, 1 
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being the total number of observations. A table is given 
showing the dependence of the x-locations on this ratio. 
A second table is given which compares the goodness-of-fit 
for the optimally determined x-values with four other choices, 
this being done for each criterion of the optimum position. 
The authors conclude with a brief numerical example. 


(D. H. Shaffer) 


4 


GUTTMAN, L. (Israel Inst. Applied Social Res. and Hebrew Univ., Jerusalem) 


6.5 (6.3) 


Metricising rank-ordered or unordered data for a linear factor analysis—In English 


Sankhyda (1959) 21, 257-267 (19 refs.) 


This paper is concerned with Thurstone’s problem of the 
factor analysis of observed rank-orders with respect to 
having also the underlying factors only rank ordered. That 
is, for a population P of subjects and a set ¥ of tests, the 
metrics of the observed scores are arbitrary up to monotonic 
transformations within each test. The problem is to find 
those scores which will yield a correlation matrix possessing 
properties that are most desirable from some point of view 
of common-factor analysis and which preserve the observed 
rank-orders within tests. 

The author begins with a brief exposition of the approach 
of other investigators, and then he proceeds to solve the 
general problem of assigning real numbers to given qualitiatve 
categories for a given population P in such a way that the 
resulting numerical variables have linear regressions on 
each other. The bivariate case is treated first and it is 
shown that if the original scores already have linear regres- 
sions, there is no profit in trying to transform them into new 
scores. Extensions are made to the multivariate situation. 

In general, it is shown that at most one set of new scores 
can be found to maintain the observed rank-orders. ‘The 
factor analyst has no freedom to mould the new correlation 
matrix by further consideration. ‘The paper also shows 
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how to compute all scoring systems which will yield linear 
regressions, the new scores possibly having polytonic as 
well as monotonic relations with the original rank order. 
The analysis presented does account completely for the 
dependence of tests on each other. The complete sets of 
regression linearising scores always completely reproduce 
the observed pairwise occurrences, or completely “‘ explain ” 
any bivariate dependence. 


(J. C. Reed) 


6.7 (4.7) 


An illustration of the use of stochastic approximation—In English 


Biometrics (1959) 15, 551-559 (4 refs., 2 tables). 


The paper presents an application of the Robbins-Monro 
[Ann. Math. Statist. (1956) 27, 879-886] procedure for 
stochastic approximation, used in regression situations 
where the regression is monotonic. The procedure estimates 
the independent variable x* for which the dependent variable 
takes on a preassigned value, y*. The method is distribution- 
free; the only assumption made in this experiment was 
that the regression was monotonically increasing. 

The experiment was of a sequential type. For the given 
value, y*, of the dependent variable, successive values, 
obtained by formula, were assigned to the independent 
variable, x. ‘This formula is the key to the entire procedure: 
Xny1 = Xn +an(y* —yn)- 

Apart from a reasonable first guess x,, which is not 
crucial, the primary practical problem involved is deter- 
mining the weight, a,. Robbins and Monro proved that 
if the a, are proportional to the series 1, 3, 4, 7, ..., then the 
successive estimates of x* corresponding to y* will con- 
verge. While any constant of proportionality insures con- 
vergence, a wise choice hastens the convergence. 

After the process has been applied, say, 1 times, a further 
repetition of the procedure fails to produce significant 
departure from the value of x* obtained in trial n. 1s 
pointed out that there is no statistical rule for stopping at 
some finite 1 with an associated confidence interval for x*. 
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In addition to the desired value of x* obtained, some 
scanty information concerning the regression curve itself 
is obtained from the points plotted in the vicinity of y*. 

While the Robbins-Monro procedure is only applicable 
to monotone regression situations, other stochastic approxi- 
mation procedures are available; e.g., the Kiefer-Wolfowitz 
method [Ann. Math. Statist. (1952) 23, 462-466] for 
estimating a unique maximum. 

It was assumed that the regression was flat up to a 
certain point, after which the regression increased. The 
authors were quite interested in determining the point at 
which the regression curve ceased being flat and began to 
rise. However, no procedure for obtaining this point is 
available. 

The dependent variable (y) was a ratio of treatment- 
over-control cell counts in a biological experiment. The 
ratio was used in place of a difference because it permitted 
a better guess at the constant of proportionality, a,. It is 
worth noting that should division by zero arise in using 
the ratio, the flexibility of stochastic approximation freely 
permits shifting to a difference statistic. "Two examples 
of the procedure are given. 


(H. R. Baird) 
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HARRIS, E. K. 
Biometrics (1959) 15, 424-432 (8 refs., 2 tables) 


Moving average interpolation for LD, estimates has been 
proposed by Thompson [Bact. Rev. (1947) 11, 115-145] 
and extended by Bennett [¥. Hyg., Camb. (1952) 50, 157-164] 
to include the angular transformation. Formulas and 
tables useful in calculating approximate confidence limits 
- for the LD;, without reference to the angular transformation 
are also available. 

The present paper outlines methods for determining 
“exact” confidence limits and for making significance tests 
under the moving average-angle procedure. The confidence 
limits are not strictly exact since the observed rather than 
the expected proportional response is transformed to an 
angle. ‘The author limits discussion to equally spaced doses 
and equal numbers of animals exposed to each dose but 
states that extension to differing intervals and/or numbers 
of animals is easily accomplished. 

The method consists of applying the arcsine transforma- 
tion to the observed proportional responses, using Bartlett’s 
adjustments when the proportion is 0 or 1. Simple averages 
of k successive angles are computed and each average angle 
associated with the middle dose of the respective set of k 
doses (usually 3 or 5). The LD5o is estimated by linear 
interpolation between the successive doses whose average 
angles bracket 45°. 

Defining A as the fraction by which the dosage increment 
is multiplied in the interpolation process, and under the 
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KRUSKAL, W. H. (University of Chicago) 
Ordinal measures of association—In English 


(Robert A. Taft Sanitary Engineering Center, Cincinnati) 
Confidence limits for the LD 59 using the moving average-angle method—ZJn English 


6.8 (4.4) 


assumption of fixed doses and normal distribution of the 
average angles, confidence limits for the LD;,) are based on 
limits for A determined from Fieller’s theorem. Tables 
list Ay and Ag for A = 0(0:05)0-50 and g = 0-01(0-01)0-10; 
0:10(0-05)0-80 for k = 3 and 5, where g = (Z? Var Y)/Y?; 
Z is the normal deviate corresponding to the two-side 
confidence interval and Y is the difference between the 
bracketing average angles. Extensions are indicated for A 
greater than 0-5 and the author states that more complete 
tables are available on request. 

A test of significance is given for the difference between 
two estimated LD;)’s, based on the assumption that the 
dosage response curves are essentially parallel. Mathe- 
matically the difference between two estimates is expressible 
as the sum of the difference of the pertinent doses and the 
ratio of the difference of the corresponding average angles 
to the common slope of the dosage response curves. Under 
the assumption that both quantities in the ratio are normally 
and independently distributed, a lower confidence limit is 
computed for the ratio. The LD; 9 estimates are judged 
significantly different if, when the lower confidence limit 
on the ratio is substituted for the ratio itself in the expres- 
sion for the difference of the LD; 9’s, the result is greater 
than or equal to zero. A numerical example is given to 
illustrate the method. 


(M. F. Peck) 


6.9 (6.5) 


$. Amer. Statist. Ass. (1958) 53, 814-861 (93 refs., 1 table, 15 figs.) 


The purpose of the author in this review paper is to discuss 

' probabilistic or operational interpretations of the population 
values of several measures of association of bivariate popula- 
tions. The author is especially concerned with ordinally 
invariant (rank) measures and the structure of the bivariate 
population is not generally specified. 

As a background, the author first discusses the standard 
correlation ratio and correlation coefficient. The first 
ordinally invariant measure given is ‘‘ quadrant association ”’, 
to which the author assigns the archaic Greek letter “‘ koppa.”’ 
Following this are introductory sections on Kendall’s “ tau” 
and Spearman’s ‘“‘rho”’. In each case a probabilistic inter- 
pretation of the measure for the population is provided. 
It is notable that for the definition of koppa the probabilities 
of certain relationships among the elements (X, Y) of a 
bivariate random variable are required. For the definition 
of tau pairs of identically distributed independent bivariate 
random variables are considered; while for rho, triples of 
such variables are needed. 

Relationships among koppa, tau and rho are considered. 
Inequalities involving each pair of measures are given and 
all but one of these bounds are shown to be the best possible. 
Values of the three measures are given for the bivariate 
normal distribution (in terms of the correlation coefficient) 
and for a one-parameter family of distributions uniformly 
distributed over portions of the unit square. The author 
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points out that the three measures are comparable in that 
they all measure “‘concordance’’ (as opposed to “‘ con- 
nection ’’), they all range from —1 to +1 and independence 
of X and Y is a sufficient condition that each measure be 
zero. 

Sample estimators of all three measures are given, 
including the more complicated unbiased estimator of rho 
as well as the usual biased estimator. Variances and asymp- 
totically unit normal functions of the estimators are provided. 
References to exact sampling distributions are given. The 
author discusses relative merits of the estimators and in 
particular points out that they differ in their ability to 
discriminate against various alternatives to independence. 

The measures are extended to general distributions, 
considering wholly discrete cases, i.e., cross-classification, 
as well as the continuous form without tied items. Relation- 
ships among the measures in the two cases are discussed. 
A historical survey of ordinal measures of association is 
given in the final section: the list of references is encyclopedic. 


(S. Krane) 
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LANCASTER, H. O. (University of Sydney) 
Zero correlation and independence—In English 
Aust. F. Statist. (1959) 1, 53-56 (1 ref.) 


The author considers the conditions for which zero cor- 
relation between two random variables implies their mutual 
independence. 

The following theorems are proved: 

Theorem 1. If the random variables x,, ...x, are jointly 
normally distributed N(O, V), where their covariance 
matrix V is positive definite, then the necessary and sufficient 
condition for the independence of two linear forms, 
i= Lax; ancy 5e—— 2b 5x4, is that they be uncorrelated. 

v at 


Theorem 2. A necessary and sufficient condition for the 
mutual independence of the random variables x and y is 
that every function of x with finite variance is uncorrelated 
with every function of y with finite variance. 

Theorem 3. If {x}, {y} are complete sets of ortho- 
normal functions defined on the marginal distribution 
functions G(x), H(y) of a joint bivariate distribution F(x, y), 
then a necessary and sufficient condition for independence 
of the distributions of x and y is that (xy) = p,; = 0 
for every pair of positive integers 7, 7. 


(J. Gani) 
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MORAN, P. A. P. (Australian National University, Canberra) 


Regression, correlation and structure—In English 


Aust. Meteorological Mag. (1959) 24, 1-9 (4 refs.) 


This paper sets out the basic statistical theory of the associa- 
tion between two or more measured quantities. The 
distinction is made between a mathematical variable and a 
random variable or variate, and by a simple example the 
concept of the “‘ power ”’ of a significance test is explained. 

The analysis of association is necessarily carried out by 
the use of models and the theory involved in each of four 


such models is explained. The models are: 


(i) the regression model which contains only one random 
variable (y;) whose mean is linearly dependent on a 
mathematical variable (x,;), and the power of the 
associated significance test is increased by examining 
the means of the y,’s at widely separated values of x. 

(ii) The bivariate normal distribution model which 
contains two interdependent variates. Distinction 
is made between the regression line considered as a 
property of the bivariate distribution and alter- 
natively as part of the definition of the regression 
model. 

(iii) The “errors in variables” model in which two 
random variables are measurements involving random 
errors, and the model is used to arrive at an estimate 
of the underlying linear relationship between the 
unobserved structural variables which may be random 
or not. 
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6.2 (2.1) 


6.0 (5.1) 


(iv) Berkson’s model, in which analysis is made of the 


relationship between two quantities. One of these 
is an observed random variable whose means are 
dependent linearly on unobserved random variables 
whose means, in turn, are known specified constants. 


(P. A. P. Moran) 


NASS, C. A. G. (Netherlands Inst. Preventive Medicine, Leiden) 


The x? test for small expectations in contingency tables with special reference to 


accidents and absenteeism—ZJn English 
Biometrika (1959) 46, 365-385 (13 refs., 12 tables) 


When M men are classified with respect to the numbers of 
their accidents (or absences) in n periods (of perhaps varying 
lengths) the frequencies are arranged in an n-row by an 
m-column contingency table. The hypothesis of independence 
is that (relative to the various degrees of their proneness) 
each man presents the same pattern of absences over the 
periods. By the nature of the data the “expected” ceil 
frequencies may be unity or less and this paper investigates, 
numerically, the behaviour of the chi-squared test function 
(here called G) particularly in relation to the approximation 
obtained by fitting a Pearson type III curve to the first 
two moments of G as given by Haldane [Biometrika (1937) 
29, 133-143 and Biometrika (1939) 31, 346-355]. 

After some discussion of various special cases of the 
situation where the probabilities in one margin are known 
(with only one row in particular, and even more particularly 
where in this case the expected frequencies in the cells here, 
according to the null hypothesis, are all equal) the actual 
distributions of G are enumerated for 


(a) two columns (with totals both 20) and 25 columns 
(10 totalling to 1, 15 to 2), and 


(6) 2 columns (totals 20 and 280) and 200 columns 
(100 each totalling 1 and 2). 
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6.9 (5.2) 


The type III distribution is found to give a good approxi- 
mation to the 5 per cent. point in both cases. 

The paper concludes with four applications of the test 
to data on accidents and absenteeism. This is followed by 
a discussion chiefly concerned with the interpretation of 
significantly large values in this field of application. 


(D. E. Barton) 


6.2 (3.6) 


Correlation between sample means and sample ranges—In English 


JF. Amer. Statist. Ass. (1959) 54, 465-471 (3 refs.) 


The correlation between the mean and the range is in- 
vestigated for sampling from symmetric and asymmetric 
populations. It is shown that if a random variable is sym- 
metrically distributed with finite variance, then the mean 
and range of a random sample are uncorrelated. However, 
as is demonstrated by a counter example, zero correlation 
does not imply symmetry. A necessary and sufficient 
condition that the correlation between the mean and the 
range be positive (negative) is presented. It is also proved 
that symmetry of the parent population implies that the 
sample range and midrange are uncorrelated. 


(S. Addelman) 
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PANIZZON, G. (School of Statistics, Padua) 


A generalised multiple linear correlation coefficient—Jn Italian 


6.1 (6.2) 


G. Economisti (1959) 18 (New series), 81-101 (14 refs., 4 tables) 


The author considers measures of the correlation among 
several variables and in particlar the indexes proposed by 
Hotelling, namely: the canonical correlation index and the 
vector correlation index. Following the approach of Levis 
LR. C. Mat. Univ. Roma (1958) 17, 423-428] the author 
starts from a geometric approach to the problem of the 
relations between two groups of variates and calculates the 
coefficient r as a geometric mean of the regression para- 
meters. 

After having considered the properties of the generalised 
multiple linear correlation coefficient, he demonstrates that 
it is equal to the vector correlation index. 

He completes his exposition with an application to 
anthropometric measures, the correlation matrices are 
calculated together with the generalised correlation coefficient. 


(P. Bandettini) 


Editorial Note: see also abstract No. 85, 6.1. 
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PAWLIK, K. (Psychological Inst., University of Vienna) 


6.9 (-.-) 


The maximum coefficient of contingency when the numbers of rows and 


columns are unequal—In German 


Metrika (1959) 2, 150-166 (10 refs.) 


The contents of this article may be described shortly by a 
quotation of Cramér: ‘‘ The mean square contingency .. . 
Seats: 
noes yy (Din —PirP-x)” es Dix” > 
i,k Di-D-x i,k Di-D-x 
(pix = probability that an individual belongs to the 7th 
row and kth column) 

“|. . by means of the inequalities p;,<Sp;. and pi, Sp.; 
it follows from the last expression that ®?<q—A, where 
q = Min(m, n) denotes the smaller of the numbers m and 
n, or their common value, if both are equal. Further, the 
sign of equality holds in the last relation if, and only if, 
one of the variables is a uniquely determined function of 


the other. Thus 
Algae eae: 


[see p. 282 of Mathematical Methods of Statistics (1946)]. 
Instead of using ®? which is defined by 


C = Le = 5 ea 
NE X= Acie 
72 Vik 


where f? = a —1, when Puy and v,;, is the 
1k PiePric 
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number of individuals in the place (7, k), N the total number 
of individuals. ‘Therefore f? is the sample characteristic 
corresponding to ®?, 

It is shown that the results must be the same as cited 
above. The proof given is not exact in the mathematical 
sense, but is substantially true and may be made rigorous 
without difficulty. 

The author finally discusses a ‘‘ measure of correlation ” 
in a contingency table as introduced by Gebelein. Since 
Gebelein starts from a quite different point of view, there is 
no connection between the two methods. 


(K. Weichselberger) 


Remark of the Regional Editor: 
The correction of several misprints in this paper appear 


in Metrika (1959) 2, 256. 


(J. Pfanzagl) 
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SIBUYA, M. & HAGA, T. (Inst. Statist. Math. and Sano Pulp Co. Ltd., Tokyo) 


6.1 (11.1) 


Orthogonal polynomials without constant term—In English 
Ann. Inst. Statist. Math., Tokyo (1959) 10, 209-222 (2 refs., 2 tables) 


The problem of fitting curves which pass through the 
origin requires the numerical values of orthogonal poly- 
nomials which have no constant terms, in the same way 
as the problem of fitting general curves requires Tchebysheff- 
Fisher polynomials. With this in mind, the authors give 
expressions of such polynomials and tabulate the numerical 
values related to them. 

Let N be a positive integer and XN)(¢) (I eee ND) 
be polynomials in x of degree n whose constant terms vanish. 
These polynomials are required to have the orthogonal 
property with respect to N equidistant points; that is, 
the summation over x = 1, 2, ..., N of XN) XN) (0) is 
required to vanish for any distinct two integers n and m. 
This property determines X CN) (se) up to constant multipliers. 
Using some quantities calculated with the aid of these 
polynomials, the coefficients of a regression model which 
is expressed in the form of the sum of a polynomial of 
degree n and a homogeneous error term are estimated in 
the sense of least square. The quantities required for this 
purpose are the values of orthogonal polynomials X Wy (x) 


and their norm S‘%)—the square sum of X ve (x) over 
ete cae, IN ; 

The explicit forms of XN) and Se oe == tle WY, By 4 
are given: for general n, the recursive formulae of X Gy (x) 
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TEGHEM, J. (Université Libre de Bruxelles) 


Confluence analysis—In French 


are given. X\)(x) is also expressed as the nth difference 
of a certain polynomial of degree 2x. The numerical values 
Xx) for n = 1, 2,3,4; N = 2(1)23, x = 1(1)N, with the 
values of Ss ) are tabulated in Table 1, and the coefficients 
of X)(«) in Table 2. 

The authors also consider the heterogeneous case where 
the standard deviation of the error term in the regression 
model is proportional to x, which can be reduced to the 
homogeneous case. 


(K. Isii) 


6.4 (4.8) 


Cahiers du Centre d’Etudes de Rech. Operat. (1959) 3, 5-34 


The purpose of this paper is to show the object and methods 
of confluence analysis, with as little technical detail as 
possible. 

Using Markoff’s theorem, the author first recalls the 
mathematical model to which the least squares method may 
be applied in the problem of estimating a structural linear 
equation. Among other things, it is assumed that there is 
simple structural collinearity and absence of errors upon 
the explanatory variates. 

As Frisch emphasised, the joined failure to obey these 
two hypotheses may lead to fictitious determinations, without 
sense, if the least squares method is applied. The author 
develops these remarks by substituting Koopmans’ prob- 
language for Frisch’s non-probabilistic one 
[Koopmans, Linear regression analysis of economic time series, 
Netherlands Economic Institute (1937) Haarlem]. He 
states the model used by Koopmans in this last work. In 
this model, all variates are liable to errors, but simple 
structural collinearity is still assumed, so that Frisch’s 
criticisms concerning multicollinearity remain valid. When 
there are no errors multicollinearity would clearly appear 
in the estimations by the determination of the latter. The 
presence of errors conceals this indetermination as errors 
have what Frisch calls a “‘ cushion effect”. The case of 
simple collinearity close to multicollinearity, are also sensitive 
to the “‘ cushion effect ”’. 
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‘The author then shows how Frisch achieves the following 
three aims: (i) to test simple collinearity, (ii) to obtain 
spaces answering simple collinearity, and (iii) to estimate, 
in each of these spaces, the coefficients of the corresponding 
structural equation. He recalls Frisch’s non-probabilistic 
model, infers fundamental properties and explains the 
method. The simple collinearity test and the obtaining 
of spaces answering this collinearity are based upon sub- 
jective comparison, for each pair of variates (x,, x;), of the 
regression coefficients of x; on x; in the various spaces to 
which the pair belongs. In relation to every couple, the 
other variates are classified into useful variates, superfluous 
variates and determinental variates. Each of the possible 
spaces of the variates is then studied in its whole, according 
to the nature of the variates with regard to the couples of 
the space. These comparisons and this study are made 
with the help of numerous graphs and tables, the bunch 
maps and the star maps. 

Only those spaces are retained in which all variates are 
useful with regard to all the couples of other variates in 
the space. In them and in them only, is set the problem 
of estimation. Frisch solves this problem by using his 
diagonal regression. The lack of precision of the estimate 
is expressed by a factor of significance. 


(J. Teghem) 
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UEMATU, T. (Inst. Statist. Math., Tokyo) 6.4 (0.1) 


Note on the numerical computation in the discrimination problem—In English 
Ann. Inst. Statist. Math., Tokyo (1959) 10, 131-135 (3 refs.) 


The author attempts to simplify the numerical computation equation to the solution of some secular equation with 
of the secular equation, the solution of which is needed very small dimension. This reduction simplifies the 
in the discrimination problem. This secular equation has, numerical solution of the equation. 
on each side of its equality, two matrices: the one is positive 
definite, and the other is symmetric and singular. The (Y. Suzuki) 
usual computation method for solving the equation contains 
the computation of the inverse matrix of the positive definite 
matrix and its multiplication with the other matrix. By 
these operations the equation is reduced to the usual form 
and solved. 
In the actual discrimination problems, especially in the 
quantification problem, the dimension of the equation 
concerned is usually very high, so that the above-mentioned 
operations of matrices are very troublesome. In the develop- 
ment of this situation some simpler methods are presented. 
In most practical cases the number of groups, into which 
the discrimination is carried out, is very small. The rank 
of the above-mentioned singular matrix is less than the 
number of groups, so that it is very small. More precisely, 
this singular matrix is given by the product of some matrix 
with the transposition of it, the number of rows of the 
matrix being not greater than the number of groups. This 
makes it possible to reduce the solution of the original 
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WEICHSELBERGER, K. (Sozialforschungsstelle, Universitat Minster) 6.9 (4.8) 
The estimation of parameters for contingency tables with marginal totals 
given in advance (I)—In German 


Metrika (1959) 2, 100-130 (4 refs.) 


The paper deals with the problem of estimating cell The proof given is synthetic, i.e., a solution is constructed 
frequencies of contingency tables, when the exact totals of by successive approximation. Finally it is shown that the 
rows and columns are known in advance and a sample is solution is unique. 

taken to obtain information on the cross-classification. As 


the totals of rows and columns in the sample are not usually (J. Pfanzagl) 
in accordance with the data given in advance the question 
of adjustment arises. Editorial Note: see also abstract No. 445, 6.9. 

Results are obtained for the case of an infinite universe. Weichselberger 


In this case the estimation of the cell-probabilities p;;, is 
bounded by the marginal-probabilities given. Maximum 
likelihood estimators are obtained as p;, = Ajx/ce;+d,, where 
h;. is the relative frequency in the sample and the c; and d, 
are Lagrange-multipliers which are determined by the side- 
conditions giving the relation to the marginal probabilities. 
It is shown that this system of equations has one and only 
one solution with all 6;,=0, provided there does not exist 
a row iy so that h;,, = 0, for all k, for which h;,>0 in any 
other row. If this condition is not fulfilled, there always 
exist p;, and p.,, so that the system has no solution. But 
it is possible, of course, to use x;, +1, instead of x,; in these 
cases, without changing the asymptotic properties of the 
maximum likelihood solution. 
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WEICHSELBERGER, K. 


(Sozialforschungsstelle, Universitit Minster) 


6.9 (4.8) 


The estimation of parameters for contingency tables, with marginal totals 


given in advance (II)—In German 
Metrika (1959) 2, 198-229 (12 refs.) 


At first, the possibilities of solving the system of maximum 
likelihood equations obtained in the first part of the article 
is discussed. 

The method of successive approximations is used to 
calculate the solutions. The elements of the approximating 
sequences fulfil the relation £,;, = h;,;,/c;-+d,, but do not 
equal both row totals and column totals. One step approxi- 
mately fits the row totals, the next the column totals, and 
so on. The convergence is proved using the main result 
obtained in the first part of the paper. 

Three different methods fitting the totals approximately 
are discussed; they are different in the complexity of the 
single steps and in the rate of convergence. Their qualities 
are illustrated and tested by means of an example. 

Deming and Stephan have proposed a solution to the 
problem of contingency tables with fixed marginal totals 
by means of a least squares adjustment. The method leads 
to a system of equations p;, = h;,(1+y,;+5,) together with 
the conditions concerning the marginal probabilities. The 
solutions of this system are discussed; they are found to 
differ from the maximum likelihood solution only by terms 
of second and higher order in (p;,—/;,.) and (p,—A4). 

All proofs of convergence concerning the calculation 
by iterative adjustment in this case can be given by means 
of the results of Part J. 
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WETTE, R. (Zoolog. Inst., Universivie Heidelberg) 


Finally, the method of iterative proportional adjustment 
—which constitutes a third kind of solution—is discussed. 
It leads to pi, = hipCiDy. 

The solution of this system can also be calculated by 
iterative methods and the proof given in Part I remains 
valid. The solution is also of theoretical interest, as it 
can be shown that it uses only the information contained 
in the sample without regard to the marginal probabilities. 
Under certain rather general conditions the distribution in 
a contingency table is determined by two sets of parameters. 
One of them gives the marginal totals, the other one gives 
the remaining information, i.e. the association within the 
table. The method of iterative proportional adjustment 
combines the second set of parameters estimated from the 
sample with the totals given in advance. The author states 
that this solution is to be preferred, if it is suspected that 
there is a bias in the totals of the sample. 


(K. Weichselberger) 


Editorial Note: see also abstract No. 444, 6.9. 
Weichselberger 


6.0 (6.1) 


Analysis of regression and causal relation in biology—IJn German 


Metrika (1959) 2, 131-137 (27 refs.) 


A concise presentation of problems in the analysis of relations 
between biological variables is given, together with many 
references to the pertinent literature up to 1956—the time 
when the paper was originally read by the author. A clear 
distinction is made between two fundamentally different 
problems in biometry. 

The relation between two or more variables can be 
adequately analysed by the standard methods of regression 
and correlation analysis, if and only if the object is a mere 
description of the relations between observational data. 
Different methods, however, have to be applied if the 
structural relation is at issue, i.e., the relation which is 
postulated to cause the empirical relation as observed in a 
sample. Whereas there is only one single structural relation 
between two biological variables, ordinary regression 
calculation yields two regression lines. Consequently the 
parameters estimated by this method are not identifiable 
with the parameters of the structural relation. 

Various problems are discussed briefly and an outline 
is given of appropriate methods in order to arrive at con- 
sistent estimates of the linear structural relation between 
two biological variables. The case of both variables being 
subject to error is also considered. The review of the 
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literature on each of the problems discussed will be 
appreciated by those who desire further information on the 
subject. 


(F. X. Wohlzogen) 
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DAVID, FLORENCE N. (University College, Tendon) 


7.5 (3.1) 


The z-test and symmetrically distributed random variables—In English 


Biometrika (1959) 46, 123-129 (6 refs.) 


Some years ago David and Johnson [Ann. Math. Statist. 
(1951) 22, 382-392] gave the exact cumulants and cross- 
cumulants of the distributions of the sums of squares, 
usually associated as test-criteria for the general linear 
hypothesis, under extremely wide assumptions. It had been 
intended to use these exact values to explore arithmetically 
the effect on the F (or 2) test of departures from the basic 
assumptions of the analysis of variance, but the amount 
of arithmetic involved was intimidating. In this present 
paper, techniques are described which considerably reduce 
the exploration of departure from normality etc., as far as 
the calculations are concerned. The case of symmetrically 
distributed variables in the one-way classification is the 
only one discussed, but it is pointed out that the techniques 
are applicable in any of the analysis of variance set-ups which 
are special cases of the general linear hypothesis. 

For normal variables, 22 = log,F is approximately 
normally distributed for degrees of freedom which are not 
small and even for small degrees of freedom can be reasonably 
represented by the Edgeworth form of the Gram-Charlier 
series. It may be assumed when the variables are not 
normal but the divergences are not too marked that the 
distribution of zg may still be represented by this series. 
Writing z, for z derived from a normal population, and 
za for that from a non-normal population, a series expansion 
is obtained for z4 about a given significance level of 2. 
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HALL, W. J. (Univ. of North ee tan Chapel Hill) 


It is shown that for moderate divergences in the parent 
population the effect on the nominal significance level is 
not large. 

The case of unequal group sizes is also discussed. It 
is pointed out that the effect of unequal group sizes is to 
make the mean and variance of z4 more nearly those of z, 
and that calculations for the effect of non-normality on the 
distribution of zg are therefore best carried out under the 
assumption that all the groups are of equal size. 

The inverse series of Cornish-Fisher is used to obtain 
comparisons between z4 and z, at the « per cent. level of 
significance. The difference between the two values for 
a given level of significance is shown to be of small order 
and substantially in agreement with that given by Gayen 
[Biometrika (1950) 37, 236-255]. 


(Florence N. David) 


7.7 (5.4) 


The most-economical character of some Bechhofer and Sobel decision rules—In English 


Ann. Math. Statist. (1959) 30, 964-969 (10 refs.) 


Bechhofer has considered a single-sample multiple-decision 
procedure for choosing, among a group of normal popula- 
tions with common known variances, that population with 
the largest mean; and, with Sobel, a procedure for choosing 
the normal population with the smallest variance. Several 
other analogous problems have also been considered. 
[Biometrics (1958) 14, 408-429.] They suggest, with only 
intuitive justification, choosing the population with the 
largest (smallest) sample mean (variance), and give tables 
for finding the minimum sample size (assumed equal for 
all populations) which will guarantee a correct decision 
with prescribed probability when the extreme population 
parameter is sufficiently distinct from the others. This 
paper gives justification for a wide class of such procedures, 
proving that no other rules can meet this guarantee with a 
smaller (fixed) sample size; that is, such rules are most 
economical [Ann. Math. Statist. (1958) 29, 1079-1094]. 
Proof of the most-economical character of these rules is 
achieved by proving their minimax character when a suitable 
loss function is introduced. 

The main result is applicable to any analogous problem 
of choosing the population with the most extreme parameter 
when, for each sample, there is a sufficient numerical statistic 
with a monotone likelihood ratio and the (numerical) para- 
meter is a location or scale (but not range) parameter in 
the distribution of the statistic. It is extended to parameters 
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which are not location or scale parameters, but the guarantee 
then holds only at a specified location; for many problems, 
a least favourable location can be determined so that the 
guarantee can be made to hold irrespective of location. 
For example, the procedures of Sobel & Huyett [Bell Syst. 
Tech. F. (1957) 36, 537-576] for choosing the largest of 
several binomial parameters are proved to bemost economical. 
The broader optimum properties of these latter procedures, 
for choosing the population with the largest ‘‘ survival 
probability ”’ irrespective of the form of the distribution, is 
also discussed. Extensions to problems of ranking the 
populations according to the parameter values, or of grouping 
them by ranks, are also given. 


(W. J. Hall) 


Editorial note: see abstract No. 96, 7.7. 
Hall 


/| 


it 
i] 


7 
iv 


it 


Miz eee Sins sath. hy iy i 
i : 7d Ty toake we Same Dr uM eee 
) ei) AE ade alle lle 

* =) aaa es Seine 

° Ks vole! Te ie Ser as %: a! aah a 
pali De 010 0 hate te Sei 
iy i t a) hans Oia Cg iaite , 
3 ols le aie eae 
Pe ey ~ Rtkaccsis 


y A 
" . TAU Ea D 544 


— (5, tie ’ Rieti i ‘ 
} Pee Ne ae: iO 

2 ere eek 

= tae : eat ht Seah ani 
; ee ee 


* 


= ; = el ms “ ‘ sm) als TF pend a 
Neha ed ade oan kil ate 


Ley Sok) gtk Ape yal gaa eat «ay ne en 
“iy ) Dees aA Saaeica a wie ‘a 


ase. Ode ie de Ly uls is 


hi LR 
a > wt Pee aoe 7 Date ry 
eh aM aA ieee ate | 
, ; * s 
¢ oA sie: 
t beta 
it ray as 45 
(4 i © babys 
r - ' er 
ity. aye nae 
ye 4 f ce nu 
fs ‘ <a ee ty 
' we! Via eee We 
oes AT it pate Pes, 
“ree “a! Shiep aaa _ i 
yes <P: soniye 
oO (Mi yh) ae ai aba 
ta ey es, ae 
‘ 4 UP aoe vey 7 a 
fe & “Sei See fi : FA ‘ahs 


a! 


np ie rag iaten 

iy *) cea és ie ot) 
Habe? saa )'s 7 i re Lia’ 
oe fk) Se ee RUA Ff 


MnCl eves ae a he i ; 
aes tah a8 ee onan 


F ii 
“a 


ot hay a8 hee 


“1 again pak ; rat : 

ae aD irda i nak 

ihegietine sal : 

> ren Feet pryres hs 
, 7 =F 7 - 5 ( fi 

re r . ral y , ps 


‘7 


HENDERSON, C. R., KEMPTHORNE, O., SEARLE, S. R., & von KROSIGK, C. M. 


(Cornell Univ. & Iowa State Coll.) 


7.3 (7.4) 


The estimation of environmental and genetic trends from records subject to culling—In English 


Biometrics (1959) 15, 192-218 (5 refs., 3 tables) 


The formal application of least squares to the estimation 
of parameters in the presence of selection leads to biased 
estimates; the extent and direction of this bias depends on 
the method of analysis employed. The authors discuss this 
situation for a special example, i.e. the estimation of pro- 
ductivity trends due to heredity and environment, in the 
presence of culling. They propose two more appropriate 
methods for the analysis of such data. Both methods regard 
as fixed effects a “‘ general mean production ability” and 
the “‘ environmental effect of the kth year”’. In the first 
method a “ group effect’ of cows-in-a-herd is added in 
which groups may constitute daughters of the same bull, 
groups entering the herd at the same time, etc. ‘The other 
method specifies this ‘‘ group effect”’ as “‘ the amount by 
which the group of individuals entering the herd in year t 
deviate.”” Both methods regard the hereditary effect, the 
“producing ability of a cow,” as a random variable with 
zero mean and equal variance oe) and independent of the 
error (variance o”). In the first method (Henderson), the 
“* repeatability ’? index, r = o? |(o? +07) is assumed to be 
known, and thus the normal equations obtained, for the 
maximum likelihood estimation of fixed effects, have the 
familiar form of a mixed model analysis. Consequently, 
the estimation procedure is quite straightforward; it is 
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MITRA, S. K. (Indian Statistical Institute, Calcutta) 
Some remarks on the missing plot analysis—In English 
Sankhyda (1959) 21, 337-344 (9 refs., 4 tables) 


Several authors (Neyman, Welch, Pitman, Kempthorne, 
Wilk and others), while investigating the possibility of 
validating the customary F-test in the analysis of variance 
for designed experiments, have considered the stochastic 


character of the observed variables as due primarily to the 


random assignment of the treatments to the experimental 
units. It is possible (theoretically, at least) to write down 


_ their joint distribution as soon as the randomisation procedure 


R is specified. Their researches tend to show that the 
F-test is unbiased in the sense that under the null hypothesis, 
both the numerator in F (the treatment mean-square) and 
the denominator (the error mean-square) have the same 
expected value. 

The object of the present paper is to demonstrate that 
this is no longer true with the analysis-of-variance [ test 
when the yields on some of the units are missing. For 
simplicity, only the cases of the randomised block and 
latin square experiments with a single missing observation 
are discussed. For computing the expected values, in- 
dependent repetitions of R (with the same set of experimental 
units reporting missing yields each time) are considered. 

The author remarks that such a bias possibly affects the 
test of equality of treatment effects only in so far as the use 
of the percentage points of the F-distribution for judging 
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described in detail and illustrated with a hypothetical set 
of data. The second method (Kempthorne and von Krosigk) 
assumes that 7 is not known and must be estimated from 
the data. The maximum likelihood method in this case 
leads to a complicated set of equations which, the authors 
suggest, should be solved by the Newton method for several 
variables, with perhaps certain simplifications similar to 
those used in bio-assay. The presentation of the maximum 
likelihood solution in this method should be of special 
interest to proponents of the ‘‘ recovery’ analysis. The 
juxtaposition of the two methods shows, not only in this 
case, how the simple analysis of a mixed model, under the 
(unrealistic) assumption that the population variances are 
known, must be replaced by a quite complicated analysis if 
the two types of variances are to be estimated. The second 
method is illustrated by data of the Iowa Board of Control 
herd at Woodward. In the final section, the fourth author 
(Searle) shows the obvious equivalence of the two methods 
when r¢ is known. Henderson uses two types of derivations 
(general linear hypothesis and ‘‘ step-down ’’), whereas 
Kempthorne and von Krosigk use the ‘“‘ step-down” 
technique. 


(R. E. Bargmann) 


7.4 (—.-) 


the significance of the computed F will either under-estimate 
or over-estimate the level of the randomisation test. The 
bias may be unimportant when the number of missing 
observations is relatively small. Further, if we can regard 
the missing plot as one chosen at random from all the plots, 
the bias vanishes in both the cases considered in this paper. 


(K. Rajaraman) 
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MODE, Cc. J. & ROBINSON, H. F. (Montana State Coll., Bozeman and N. Carolina State Coll., Greenboro) 7.2 (6.2) 
Pleiotropism and the genetic variance and covariance—In English 


Biometrics (1959) 15, 518-537 (5 refs., 3 tables) 


This paper extends the concepts of additive and dominance 
components of the genetic variance to genetic covariance. 
Genetic, genotypic, and phenotypic coefficients of correla- 
tion are defined and a method of estimation is given under 
the assumption that a gene affects unrelated characters; 
that is to say, its action is pleiotropic. To estimate the 


above coefficients of correlation and the average degree of 


dominance, it is necessary to perform an experiment where 
a random sample of males is mated to a random sample of 
females, each female is used once and only once. 

Since the usual arguments would not hold in obtaining 
genetic covariance between different characteristics, the 
authors derived the variances and covariances of these 
means by direct calculations. Explicit formulae are given 
in terms of expectations and in terms of genetic components 
for all variances and covariances. 

Design I of Comstock and Robinson [Biometrics (1948) 
4, 254-265] is reviewed and all formulae required to obtain 
the analysis of variance are given. The expectations of the 
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several mean squares are presented and estimates of the 
various correlations are given under the absence and presence 
of epistasis. Variances of the correlations are obtained so 
that the reliability of the estimates may be obtained. 

A numerical example is provided to illustrate the various 
computations that are involved. 


(C. Y. Kramer) 


7.2 (4.1) 


The sampling variance of the genetic correlation coefficient—In English 


Biometrics (1959) 15, 469-485 (4 refs., 4 tables, 1 fig.) 


Genetic correlations may be estimated by (a) analysis of 
parent-offspring correlations or (b) analysis of variance and 
covariance components within and between relatives. 
Formulas for the sampling variance have been derived for 
(a), but not for (6). The author presents formulas for the 
sampling variance for (b), taking the special case in which 
heritabilities are equal for a given pair of characters. 

Both variates are transformed into standard measure, 
then cast into a single analysis of variance. ‘The between- 
group variance is partitioned into a mean square for groups 
and one for group-by-character interaction. Similarly, 
the within-group variance is divided into mean squares for 
individuals and for error. Variance of the genetic correlation 


- coefficient is derived as a function of these four mean squares. 


The resulting formula then is converted to one which employs 
group size, number of groups, the intra-class correlation, 
and the within- and between-group correlations. 

If the two variates under consideration are measured in 
different individuals, the estimated variance may be biased 
upward or downward, depending on group size and the 
magnitudes of the within- and between-group correlations. 
If both group size and the intra-class correlation are large, 
the variance of the genetic correlation approaches that of 
the correlation coefficient itself. If the genetic correlation 
approaches zero, its sampling variance increases, but can 
only be doubled in the most extreme cases. 

The possibilities of a general solution, for cases in 
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which the two characters have different heritabilities, are 
discussed. It is presumed that the appropriate formula 
will contain some joint function of the two heritabilities in 
its denominator. 

A special use of the genetic correlation coefficient is to 
detect genotype-environment interactions. If the genetic 
correlation between performances in two different environ- 
ments is less than unity, and the respective between-group 
variance components are not different, an interaction may 
exist. The biological importance of this interaction depends 
on (i) the deviation of the genetic correlation from unity 
and (ii) its standard error. The estimation of such standard 
errors is discussed, both within populations and over wider 
genetic bases. The case of more than two environments is 
also considered. In general, optimum group size in each 
environment is more than twice as large as for estimation of 
heritability. f 

Optimal structures of experiments for estimation of both 
heritabilities and genetic correlations appear to be the 
same. However, the sampling variance of the genetic 
correlation coefficient will be much larger. 

The author discusses limitations of the foregoing 
approach. Primarily, the assumption that the two characters 
have the same intra-class correlation. Secondly that the 
genetic correlation coefficient is a ratio, and therefore its 
variance should be estimated from large samples. 


(N. R. Thompson) 
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SRIVASTAVA, A. B. L. 


(Indian Institute of Technology, Kharagpur) 
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7.5 (3.2) 


Effect of non-normality on the power of the analysis of variance test—In English 


Biometrika (1959) 46, 114-122 (8 refs., 3 tables, 1 fig.) 


The author discusses the power of the F-test for the one- 
way classification set-up when there are k groups of n 
observations. The parent population is assumed to have 
a distribution which can be represented by the first four 
terms of an Edgeworth series. The values of the parameters 
of this series are restricted so that the distribution has a 
functional form which is positive, definite, and unimodal. 
The method of attack on the problems is that which has 
been made familiar in a series of papers by Gayen [see, 
for example, Biometrika (1949) 36, 353-369 and Biometrika 
(1950) 37, 236-255]. 

The author finds that skewness of moderate order has 
little or no effect on the power of the analysis of variance 
test for the one-way classification considered. ‘The effect 
of kurtosis on the other hand, is marked, especially for small 
samples. When the population is leptokurtic the effect is 
to increase the power near the significance level point and 
to cause it to decrease, by comparison with normal theory, 
in the region where the normal theory power is high. The 
reverse is true for platykurtois. Any effect appears to 
decrease in importance with increasing sample size. The 
general conclusion is that the effect of non-normality on 
the power function is of little consequence if the parent 
population is not widely divergent from that of normal. 


(Florence N. David) 
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7.8 (5.6) 


A multiple comparison rank sum test : treatments versus control—IJn English 


Biometrics (1959) 15, 560-572 (5 refs., 3 tables) 


In this paper the author presents a non-parametric test 
which is an analogue of Dunnett’s procedure [¥. Amer. 
Statist. Ass. (1955) 50, 1096-1121] and also a generalisation 
of Wilcoxon’s test [Biometrics (1945) 1, 80-83]. The non- 
parametric nature of the test enables one to by-pass the 
usual analysis of variance assumptions. The test itself is 
a rank-sum multiple comparison test applicable in situations 
where there is a control and a number of treatments in a 
one-way classification; the number of observations on the 
control and each treatment is the same for all groups. 

The proposed test is on the null hypothesis that the 
control and treatment variables have identical cumulative 
distribution functions. This hypothesis is tested against 
one of the following alternatives: 


(i) At least one treatment variate is stochastically 
smaller than that of the control; 
(ii) At least one treatment variate is stochastically larger 
than that of the control; 
(iii) At least one treatment variate is stochastically 
different from that of the control. 


All the cumulative distribution functions are by assumption 


continuous. 

The measure for a single comparison is the minimum 
of the rank-sums on the control and a particular treatment 
when the two sets of values are ranked together in a single 
array. ‘The proposed test statistic is then the minimum 
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over all treatments of these minimum rank-sums. For a 
given experiment its empirical value is compared with the 
tabulated value as determined by a preassigned significance 
level a, the number of treatments k, and the number of 
observations on each treatment n. 

In Section 4, the author discusses the exact distribution 
of min{min(T,, Ty, where JT; and Te denote the rank- 
sums on a control and the 7th treatment variables. This is 
followed by a table with exact probabilities for k = 2 and 
3 and 2 = 3,4,5. In Section 5, it is assumed as an approxi- 
mation that (7,, T., ..., T;), where T; is the sum of the 
ranks assigned to the control variable when ranked with the 
ith treatment, is from a multivariate normal distribution 
and that [min T;—E(T,)]/[var (T))]}* is distributed approxi- 
mately as Dunnet’s ¢ with infinite degrees of freedom. 
Under the above assumptions, the author obtains approxi- 
mate significant values of rank sums for both one-sided 
and two-sided alternatives at confidence level 1—a = 0°95 
and 0:99. The tabulated values are for cases where the 
number of treatments (excluding control) is k = 2(1)9 and 
the number of observations on each treatment is n = 4(1)20. 

Section 3 is devoted to an example based on data obtained 
from the University of Wisconsin School of Medicine. 
Here, the author applies his work in testing which of three 
treatments results in lowering IQ scores below normal, and 
then demonstrates that application of Dunnet’s procedure 
leads to the same conclusion. 


(B. J. Trawinski) 
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van ELTEREN, PH. & NOETHER, G. E. (Math. Centre, Amsterdam and Boston Univ.) 


7.7 (5.1) 


The asymptotic efficiency of the x7-test for a balanced incomplete block design—ZIn English 


Biometrika (1959) 46, 475-477 (8 refs.) 


This paper is concerned with the asymptotic relative 
efficiency in the sense of Pitman [see for example, Hannan, 
J. R. Statist. Soc. B (1956) 18, 227-233] if the number of 
blocks tends to infinity of a distribution-free test for a 
balanced incomplete block design due to Durbin [Brit. Fe 
Psychol. (1951) 4, 85-90] with respect to the corresponding 
analysis of variance test. It is assumed that the underlying 
variables have distributions that may differ only in location 
parameters depending on block and treatment number. If 
these distributions are normal, the asymptotic efficiency 
E = 3k/m(k+1) and in general E depends only on the 
block size k and not on the number of treatments 7. 
The asymptotic efficiencies of the following tests are 
obtained as special cases 
1.k=n. Friedman’s xz-test for designs with two 
classifications, also known as the method of m rankings. 
2. k= 2. A test for the problem of paired comparisons 
asymptotically equivalent to a test of Bradley & Terry, 
discussed by Bradley [Biometrika (1955) 42, 450-470]. 
Sak = 7— 2, The'sien test. 
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Earlier results of Bradley in the reference quoted above | 


for case 2 and of Hodges & Lehmann [Ann. Math. Statist. 
(1956) 27, 324-335] for case 3 do not quite agree with the 
results obtained in this paper, but it is shown that the 
differences are due to modifications in the methods of 
comparison. 


(Ph. van Elteren) 


7.0 (2.2) 


Computation with rounded or transformed values, especially in analysis of variance—In German 


Biom. Zeit. (1959) 1, 190-202 


This paper contains rules for fixing the accuracy of measure- 
ment or the rounding-interval in those cases where the 
variance of observed values is to be computed or where the 
values are to be subjected to an analysis of variance. ‘The 
application of these rules leads to a considerable reduction 
of costs involved in the observation of the values and their 
evaluation. 

The exact values x may be distributed around the 
expected value pu with the variance o”. If x«* stands for the 
rounded value corresponding to x, and ¢ for the greatest 
possible difference between two corresponding rounded 
values, the rounding error y = x*—vx lies in the interval 
—c/2<y<-+c/2. On the assumption that y is stochastically 
independent of x and that it is uniformly distributed in this 
interval, the expected values of x and x* are equal. The 
variance of the rounded values is given by o*? = o?+c?/12. 
Accordingly, the relative increase of the variance due to 
rounding is: U = (o*#—o?)/o? = 1/12(c/c)*. If a specified 
enlargement U is admissible, the rule c = o/12U ensues 
for c. If, for instance, U = 1 per cent., then c = 0:35 ce. 
Assuming that the values of measurement lie in a range 
of 5a, it is sufficient to distinguish approximately 5 : 0-35=14 
classes of values in the computation. 

In computing the variance from many values it is common 
practice to classify the values according to approximately 
10 to 20 classes. According to the above considerations, 
this simplification can also be utilised in those cases where 
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only a few values are available. It is suggested that the values 
should be coded, say, between —10 and +410 and to 
round the transformed values off to integers. With the 
help of the values coded in this way it is easy to compute 
the mean value and the variance; the variance thus computed 
exceeds the variance of the original values only by the given 
proportion U. 

This procedure can also be employed in the analysis of 
variance. Let of denote the total variance and U for the 
admissible proportion by which of may be enlarged by 
rounding. If G-+1 specifies the order of magnitude of the 
variance-ratios F’, the following relation holds for a 2-way 
variance-analysis with interactions: ¢>o;V12U/V1+2G/n, 
where nm denotes the number of members in each cell. In 
an v-way variance-analysis, in which interactions are taken 
into account only up to the second degree, the following 
approximation applies: c>o,V12U/V1+4G/n. On the 
assumption that U = 1 per cent. and G = 7, it is sufficient 
to use two digit integers lying in an interval of length 60. 
It is suggested that the original values are coded to integers 
which lie in the range +30. 

By using the example of a 3-way variance-analysis it is 
shown numerically that the differences between the F-ratios 
of the exact values and of the coded values are relatively 
small. The slight extra work involved in the coding is 
largely compensated by the advantage of having convenient 
figures to work with in the analysis. 


(K. A. Schaffer) 
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AROIAN, L. A. 
Quality control charts by gauging—Jn English 


(Hughes Aircraft, Culver City, California) 


8.9 (4.1) 


Nat. Conv. Trans., Amer. Soc. Qual. Contr. (1959) 145-153 (4 refs., 4 tables, 2 figs.) 


It is assumed that a characteristic, x, of an item is normally 
distributed. Where exact measurements of x are expensive, 
it is suggested that gauging be used. The first gauge will 
be used to classify all items less than x, and a second gauge 
to classify all items above x,. Let p, g and r represent the 
true proportion of the population below x,, between or 
equal to x, and x2, and above xp, respectively. 

In a sample of m items, let a represent the number of 
items below x, and c represent the number of items 
above x,. "Thus (c—a) is an indicator for a shift in the 
mean and (c-++a) is a measure of variability. Control charts 
can be constructed based on these two statistics. 

Since the underlying distribution is normal, p = r is 
chosen for control of the mean. Tables for (c—a) are given 
where p ranges from 0-05 to 0-40 for samples of size n = 8, 
10 and 12. Corresponding to each value of 9, for a particular 
n, the control limits and the probability, 2«, that (c—a) 
will exceed the control limits when the process is in control 
are given. Control limits for values of 2« between 0:002 
and 0:05 are listed. For each combination of p, n, and 2«, 
the sample size required by using the ordinary * chart is 
given. ‘The sample size was determined such that approxi- 
mately the same operating characteristic curves were obtained 
for the (c—a) chart and the * chart. The comparison is 
made for the *« chart where the estimate of the standard 
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deviation is obtained from the range, R. Efficiencies up to 
approximately 75 per cent. of the (c—a) chart compared 
to the * chart were obtained. 

A similar study of the c+a chart for variability is given. 
Efficiencies from 70-80 per cent. of the c-+-a chart compared 
to the R chart are obtainable. 

Since the (c—a) and c-+a charts are not independent, 
it is suggested that they be used in conjunction corres- 
ponding to the joint use of * and R charts. 

Estimates of the mean and standard deviation from the 
gauged data are 

HK” = (00, +22)/2 — (ty +g) 0" /2 

o” = (x2 —x,)/(u, —u) 
where Pr(wSu,) = a/n and Pr(u=u.) = c/n, and u is the 
standardised normal deviate. 


(D. W. Gaylor) 


8.8 (8.7) 


Acceptance inspection by variables wien the measurements are subject to error—In English 


Ann. Inst. Statist. Math., Tokyo\(1950) 10, 107-129 (3 refs., 1 table) 


In this paper an acceptance inspection procedure is discussed 


where the observable variable is composed of two parts; 
one from the fluctuation of the characteristic (true value) 
of the sampled item and the other from the measurement 
error. ‘The measurement is supposed to be performed 
m-times on a sampled item and the measured values are 
averaged to give the variable x; for the inspection. The 
true values are supposed to follow a normal distribution 
with unknown mean p» and unknown variance o?. The 
measurement errors are supposed to be mutually independent 
and to follow one and the same normal distribution with 
zero mean and variance. It is assumed that the constants 
R and R which satisfy the relation 0<RSo/o,SRS+0 
are known. . 

The main concern of this paper is where o, is known 


and the acceptance criterion is represented as ‘“‘ Accept the - 


lot if and only if «+k,—vo,<U”’ for some fixed constants 
k, v and U where * and s? are the mean and variance of 
sample (x, 2, ..-. xv). It is further assumed that the 
distribution of s is approximated by the normal distribution 
with mean (o2+03/m)? and variance (c?+02/m)/2N. It is 
required that the acceptance criterion has the property 
that the probability of accepting the lot is greater than 
1—« when the probability for the true value to be greater 


than U is less than p,, and the probability of rejecting the 
lot is greater than 1—f when the probability for the true 
value to be greater than U is more than py. 

The paper proceeds with a statement of results for the 
case when the variance of the measurement errors is known 
and the case where it is not known. ‘The properties of the 
upper limit (R) of the ratio of the standard deviation of the 
measurement errors (o/o,) are given together with the 
determinations of the number of measurements (m) to be 
made on each item of the sample. A brief note is given 
on the properties of a test proposed by Wallis [Selected 
Techniques of Statistical Analysis (1947), McGraw-Hill, 
chapter 1] together with a numerical example. 

The paper concludes with a detailed section showing the 
various derivations used in the development of the paper. 


(H. Akaike) 
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FRAZIER, D. & WHITMAN, D. R. (Standard Oil Co. Ohio and Case Inst. Tech., Cleveland) 


8.9 (10.2) 


Quality control applications in the oil refining field—In English 
Nat. Conv. Trans., Amer. Soc. Qual. Contr. (1959) 201-212 (4 figs., 1 graph). 


The usual techniques of quality control apply to the pro- 
duction of discrete things. In the petroleum industry a 
continuously flowing material is produced. There is no 
natural unit of production and mixing can be used to average 
out fluctuations in the material. Thus, modifications of 
conventional quality control techniques are required, since 
the assumption of independence from one sample to the 
next is not valid. 

A sample is contaminated to some extent with the material 
present when the previous sample was taken. The model 
considered is 

(P;— pP:-1)/A— p) Se oe oes 

where P; is the result obtained from the ith sample, P;_, 
the result from the previous sample, p is the proportion of 
the material still present from the time of the previous 
sample, A is the average or target value, and the E;’s are 
independent errors from a normal population with zero 
mean. ‘The assumption that the E,’s are independent is 
justified if the fluctuations about A occur rapidly compared 
to the length of time between samples. If samples are 
taken at regular intervals of A, then p = exp (—RA/V) 
where RF is the flow rate and V is the effective volume of 
the mixing tank. It is suggested that R be set so that 
0:1< p<0-9. Since the quantity (P;— pP,_,)/(1—p) varies 
independently about the target value A, conventional quality 
control techniques can be applied. | 
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MICKEY, M. R. (General Analysis Corporation, Los Angeles) 


The final product is collected and sold in batches. As 
the batch size decreases, the batch rejection rate will increase. 
As the batch size increases, there is a greater chance for 
process fluctuations to average out. A graph is given from 
which the standard deviation among batches can be found 
from the variation among stream samples. Then, the 
average or target value can be set depending on the prob- 
ability required that the batch passes some specification. 

It is desirable to estimate the final batch quality from 
samples taken from the stream during production. Graphs 
are given from which the standard deviation among samples 
with a batch can be found. From the graphs, the optimum 
number of samples could be determined in order to insure 
that the batch has a given probability of satisfying specifica- 
tions. 


(D. W. Gaylor) 


8.2 (8.1) 


Some finite population unbiased ratio and regression estimators—In English 


J. Amer. Statist. Ass. (1959) 54, 594-612 (8 refs., 2 tables) 


_In this paper, a broad class of unbiased ratio and regression- 


type estimators is given which includes the well-known 
unbiased ratio estimator of Hartley and Ross. The idea 
utilised in the construction of such estimators is used to 
construct estimators of the population total when sampling 
is done with varying probabilities of selection. 

In Section 2, a broad class of unbiased ratio and regression- 
type of estimators is constructed, and in Section 3 examples 
of some useful unbiased ratio and regression-type estimators 
are given which includes Hartley and Ross estimator. The 
variance of these estimators is not considered, but in Section 4 


- unbiased non-negative estimators of variance for a sub- 


class are constructed, and Section 5 gives numerical examples. 

In Section 6, a set of unbiased estimators with non- 
negative estimators of variance is constructed, when sampling 
is done with varying probabilities of selection. ‘This over- 
comes the defect of Yates and Grundy’s estimator of variance 
which can take negative values. 


(J. N. K. Rao) 
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MILNE, A. 


(King’s College, Newcastle-upon-Tyne, England) 
The centric systematic area-sample treated as a random sample—In English 


8.6 (8.4) 


Biometrics (1959) 15, 270-297 (26 refs., 9 tables, 6 figs.) 


It is well known that systematic samples may produce 
biased estimates of the mean and variance if periodic varia- 
tions occur in the population. On the other hand, if the 
population is randomly ordered, systematic samples may 
be treated as random samples without incurring bias. 
However, little is usually known about possible periodic 
variations in actually sampled populations, and hence 
existence of bias can only be hypothesised. 

The author has made an attempt to obtain some estimates 
of bias due to the use of the centric systematic sample on 
forty-nine biological and one artificial population on which 
The centric 
systematic sample is drawn, from a population distributed 
on a plane, such that sample points lie on equi-distant 
parallel lines, and is thus well suited for sampling certain 
biological populations. 

The author uses each of these populations to draw an 
equivalent simple random sample, stratified random sample, 


and centric systematic sample, the latter being analysed 


as (i) a simple random sample and (ii) a stratified random 
sample. The resulting estimates are then compared, 
particularly with reference to the estimates obtained through 
the centric systematic samples. 

In general, the author finds that the estimates from the 
centric systematic samples compare favourably with these 
from the other samples. The existence of hypothetical 
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periodic variations in the population is discussed, particularly 
for one of the populations where the centric systematic 
sample fared relatively poorly. In this case he strongly 
suspects that the periodic variation that caused the bias 
in the centric systematic sample is due to an enumerator 
bias. ‘Thus the author’s final conclusion is that, “ with 
intelligent caution, one will not go very far wrong, if wrong . 
at all, in using CSS (centric systematic sample) as if it were 
random ”’. 


(R. J. Freund) 


8.5 (7.2) 


Some problems of household interview design for the National Health Survey—IJn English 
J. Amer. Statist. Ass. (1959) 54, 69-87 (21 refs., 9 tables) 


Evidence from earlier surveys and from a pilot survey 
conducted in Charlotte, North Carolina, provided some 
answers in making decisions about the conduct of the inter- 
view for the National Health Survey. 

Data on illness conditions from households, in which 
all adults were to be interviewed for themselves, were 
compared with those obtained from households in which 
adults found at home were to be interviewed for themselves 
and absent related household members. ‘The sampling 
and response variability are too high to permit definite 
conclusions as to possible biases in the use of a household 
respondent. From cost considerations and experience 
from previous surveys the following procedure was adopted: 
information for an adult at home at the time of the interview 
is to be obtained from the person himself; for an absent 
adult only from the person’s spouse, if he is married, or 
from a parent or adult son or daughter residing in the 
household. Information for adults not related to the head 
of the household is to be obtained from the person himself, 
or from a proxy respondent related to him as above, if one 
is living in the household. 

Particular attention was given the problem of variability 
and bias which may be associated with the interviewer. 
Estimates were constructed for the variance between inter- 
viewers and the variance within interviewers for the statistic 
“rate per person’ based on all persons. The effects are 


462 


more marked when all adults are interviewed for themselves, 
and more marked for males interviewed for themselves than 
for females interviewed for themselves. Since all inter- 
viewers were female, the difference in sex between the 
interviewer and respondent may itself be a source of non- 
sampling variance. 

The use of check lists of diseases resulted in the reporting 
of considerable numbers of medically attended and a few 
currently disabling chronic conditions. 
“optimum ”’? length of such check lists arises. Another 
related problem is the order in which the items were 
read during the survey. In the Baltimore Health Survey, 
definite order effects were found for the symptoms, but the 
sample was not large enough to detect order effects in the 
chronic condition list. "The use of probe questions may 
be a superior method of eliciting reports of chronic disease. 


Another problem considered is the recall of illness events _ 


in a specified period of time. It was found that the recall 


was better for acute illnesses than episodes of illness caused — 


by a chronic condition, and that it was better for disabling 
and for medically attended acute illnesses than for acute 
illnesses involving neither disability nor medical care. As 
a result of the Charlotte pilot survey, two weeks was adopted 
as the recall period for all illness, injuries, and medical and 
dental care visits, except that for hospital illness a recall 
period of a year was maintained. 


(B. 'T. Onate) 
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ODERFELD, J. (Math. Inst., Polish Academy of Sciences, Warsaw) 


Statistical testing for correlated characteristics—In Polish 


8.8 (6.4) 


Zastosowania Mat. (1959) 4, 255-264 (2 refs., 1 table, 2 nomograms) 


Let the two dimensional random variable (X, Y) have a 
distribution with known mean deviations, a known correla- 
tion coefficient and unknown means py and py. 

Let X be a characteristic that is easy to test but not 
very important, and Y a characteristic that is difficult to 
test but decisive as regards the usefulness of the subject- 
matter. We can then test the characteristic X in a large 
sample, or even in a very large sample, i.e. the whole lot, 
obtaining an almost exact estimate of x. The characteristic 
Y, however, can be tested only in a small sample. 

Combining these data in a suitable way and assuming 
that the distribution of (X, Y) is a two-dimensional normal 
distribution, the author describes an acceptance test which 
will verify statistical hypotheses concerning the fraction 
defective of the lot. "The power function are given and it 
is shown that the test suggested is in certain conditions more 
economical than the classical one. 

Subsequentially the author constructs another test. In 
this test the sample is ordered with respect to characteristic 
X and only one item, having the mth extreme value of X in 
the sample (m = 1, 2, 3), is tested for Y. 


7 (J. Oderfeld) 
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ROBERTS, S. W. (Bell Telephone Labs., New York City) 


8.9 (5.2) 


Early detection of relatively small shifts in a process average—In English 
Nat. Conv. Trans., Amer. Soc. Qual. Contr. (1959) 73-78 (10 refs., 8 figs.) 


Points which are averages of n items each are plotted on 
control charts based on a process average of fy) and a 
standard deviation of o’. The effectiveness of the control 
chart is summarised by a plot of &(y) versus A, where 
&(y) is the expected number of points occurring before a 
shift to 49 +8 is detected and A = V/n 8/0’. Only one-sided 
limits are considered but an approximate method is given 
for obtaining curves for two-sided limits. Curves of &(y) 
versus A are plotted for seven procedures for rejecting the 
hypothesis, 6 = 0. 


(i) Reject when one point falls above po+Lo’//n. 
Curves are presented for L = 2:0(0°5)3°5. 
(ii) Reject when k consecutive points fall above 
Uo t+L;0'/./n. Curves are presented for k = 1(1)4, 
6, 8 where the L;, are chosen so that all tests have the 
same &(y) at A=0. For small A, a large value 
of k is best and vice versa. 
Reject if one point falls above ~»+30'/./n or seven 
consecutive points fall above po. 
Reject if the moving average of k consecutive points 
exceeds x9 +30’/+/(nk). Curves are presented for 
j7 = IGN 3 tex 
Reject if z; exceeds 9 +30’ /{r/n(2—r)}, where 2; 
is a geometric moving average with 2) = 0 and 
2; = (1—r)z;1+7%;, j>0, O<rSi, where r is 


(iii) 
(iv) 


(v) 
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arbitrary. The means, «;, are assigned geometrically 
decreasing weights back to the first mean. Curves 
are given for r = 0-25, 0-5, 1-0. 

(vi) Reject when a point exceeds py +3o0'/./n or when 
the moving average of k points exceeds py +3o’/ 
/(nk). Curves are given for k = 1, 2, 4, 8. 

(vii) Reject when a point exceeds po+3o'/./n or 2; 
exceeds the limit given in V. 


For shifts greater than 20'/./n the standard control 
chart is most practical. For smaller shifts, some of the 
other types may be useful. 


(D. W. Gaylor) 
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SAVAGE, I. R. (University of Minnesota, Minneapolis) 8.8 (1.8) 
A production model and continuous sampling plan—In English 
JF. Amer. Statist. Ass, (1959) 54, 231-247 (24 refs., 3 tables) 


The author considers a production process subject to 
random incidence of flaws. The cumulative number of 
flaws 2(t) at time t since the last trouble-shooting and over- 
haul is assumed to be a Poisson process; the probability of 
producing an effective item at time t is assumed to be the 
decreasing function PR* of 2(#), O0< P, R<1. 

Let F be the expected time between inspections; define 
cycle length as the (random) time between successive trouble- 
shootings, trouble-shooting being initiated as soon as a 
defective item is found. Then profit, defined as the expected 
value of income per cycle divided by the expected value of 
cycle length, is a function of F and several economic para- 
meters. An interesting aspect of these economic para- 
meters is that both the duration and the cost of trouble- 
shooting and overhaul is made to depend on the number 
of flaws z that have accumulated since the last overhaul. 

The author shows, by example and also in general, how 
to compute the profit-maximising F, given specified economic 


parameters. 
(H. T. David) 
465 
TRYBULA, S. (Math. Inst., Polish Academy of Sciences, Wroclaw) 8.8 (4.0) 


Estimation taking into account the error of the controller—ZIn Polish 
Zastosowania Mat. (1959) 4, 249-254 (3 refs., 1 table) 


The quality controller makes errors in the course of in- 
specting items of a sample. Let us denote by p, and py, 
the probability of recognising a good item as a bad one 
and a bad item as a good one. The author considers the 
problem of estimating the percentage defective in the lot 
from which the sample of size n has been taken (with 
replacement). The most effective unbiased estimator, i.e. 
{1/(1 —p —p2)H{m/n—Du, 

where m is the number of items qualified as bad, is not 
suitable. This is mainly because for some values of m it 
assumes negative values or values greater than unity. The 
author proposes using another estimator, which is a Bayes 
solution under the assumption that the prior distribution 
of percentage defective is uniform. 

A numerical table for m = 20 illustrates the divergence 
between the two estimators. 


(S. Zubrzycki) 
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VAGHOLKAR, M. K. (Bombay Textile Research Association) 


8.8 (8.7) 


The process curve and the equivalent mixed binomial with two components—In English 


J. R. Statist. Soc. B (1959) 21, 63-66 (4 refs.) 


The author is concerned with the form of the process curve 
assumed for the mathematical analysis of inspection plans 
on a cost basis. Barnard [‘‘ Sampling inspection and 
statistical decisions,” ¥. R. Statist. Soc. B (1954) 16, 151-174] 
proposed using a two point distribution and _ this paper 
sets out in detail a method whereby this may be achieved. 
Batches of articles are inspected for acceptance, p being the 
proportion defective in the batch. If p>p, the batch is 
“bad”; if <p» the batch is acceptable. L(p) is the 
decision loss from a wrong decision [L(p,) = 0], and P(p) 
is the prior distribution of » among batches presented for 
inspection, or P(p) is the process curve. The total expected 
cost per batch inspected is to be a minimum. 
The author takes 


P{p = pi} = a, Pld = po} = a, = 1-G, 

a, is shown as equal to the integral of P(p) with respect to 
p from zero to po. ; is chosen so that L(p;) is equal to 
the integral of the product P(p)L(p) with respect to p from 
zero to Py divided by a,. pg, is chosen so that L(p.) is equal 
to the integral of the product P(p)L(p) with respect to p 
from py to 1, divided by ay. 

Examples are given and the procedure compared with 
those of Champernowne and Sittig. 


(Florence N. David) 
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VEEN, B. (N.V. Ondomocnesinstiniat “Research ”’, AKU, Arnhem) 


Optimum allocation in nested sampling—In Dutch 
Statist. Neerlandica (1959) 13, 171-186 


The solution to the problem of optimum allocation which 


is given by Marcuse [Biometrics (1949) 5, 189] is explained 


and elucidated by an example. ‘The known formulae 
obtained via this method often do not sufficiently approxi- 
mate the true (integer) optimum sample numbers. This 
paper shows how this can be avoided. 

If one has to deal with a design for a nested sampling 
procedure, the formulae given by Marcuse make it possible 
to calculate the number of observations within each stage 
of the nested design. Some conditions, however, have to 
be fulfilled. 

(i) A reliable estimation of the components of variance 

within each stage must be known, 

(ii) the costs of observation per unit within each stage 

must be known. 
(iii) the allowed total costs or the allowed variance of the 
sample mean must be specified. 


In such a case it is possible to minimise with respect to 
the variance of the sample mean, or the necessary costs by 
choosing suitable numbers of observations within each 
stage of the sampling design. 

In practice it turns out that considerable savings can 
be realised by this method. However, it appears that if 
small sample numbers, unfavourable ratio of costs and 


- variance, and designs with more than two stages are con- 


cerned, the known formulae often do not sufficiently approxi- 
mate the true optimum sample numbers. The departures 
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8.1 (8.7) 


are caused by the discrete character of the problem. Strictly 
speaking, the formulae only hold for continuous cases. In 
practice, however, it is only possible to take an integer 
number of samples. This requires rounding off the 
fractional numbers found by means of the formulae. But, 
by doing this, we leave the optimum allocation of the sample 
numbers and the problem is how to round off to get the set 
of sample numbers which give the best solution. It appeared 
that generally the true optimum cannot be reached by 
applying simple rules for rounding off to the nearest integers. 
A number of possible ways for finding the required optimum 
sample numbers are described. ‘These are based on calcula- 
tion of the results for different combinations of integer 
numbers in the ‘‘ neighbourhood ”’ of the fractional numbers 
calculated from the formulae. 

In an appendix a short theoretical consideration is given 
about the optimum relation between variances (V) and 
costs (C), especially with respect to the integer character 
of the sample numbers. Each set of sample numbers gives 
one point (V, C). An optimum point (Vo, Co) is defined 
as a point characterised in that there is no other point 
(V, C) of which both co-ordinates are at the same time 
equal to or smaller than those of the point (Vo, Cy) under 
consideration. With the aid of this definition it is possible 
to construct an ‘‘ optimum path” of points, which can be 
used to allocate the required optimum in a certain case. 


(B. Veen) 
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ANDERSON, R. L. (North Carolina State College, Raleigh) 


9.6 (6.9) 


Use of contingency tables in the analysis of consumer prefereficé studies—In English 


Biometrics (1959) 15, 582-590 (4 refs., 3 tables) 


Consider a preference study in which m consumers are 
asked to rank r varieties. In the resulting data it is desired 
to test for independence of varieties and ranks. This is 
not the usual problem of a contingency table with fixed 
border totals since, although each of the m consumers act 
independently, he has only r/! possible preference sequences. 
Every consumer is represented in each column (rank) so 
that one may make a test of varietal differences at each 
rank, using a x” statistic (denoted by Q? form ranks j5— 1, 


..., 4) based on r—1 degrees of freedom. However, since 


& 

the QF, j =1,...,7 are not independent, 0? = ¥ QF does 
j= 

not approach a x? with r(r—1) degrees of freedom. For 

an over-all test the statistic (r —1)Q?/r is proposed. 

It is shown that the proposed test statistic has an 
asymptotic x?-distribution with (r—1)? degrees of freedom. 
Reference is made to a sampling study of the distribution 
of Q? for r = 3 with n = 30, 

Four alternative methods of analysing consumer prefer- 
ence data of this kind are considered. ‘The first is a test of 
independence of the first and second of three rankings. 
The second involves construction of orthogonal single- 
degree-of-freedom contrasts. The third alternative makes 
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BOSE, R. C. & CARTER, R. L. (Univ. North Carolina, Chapel Hill, and Illinois Inst. Tech., Chicago) 


use of a likelihood procedure for the rank analysis of triple 
comparisons, while the fourth involves analysis of the 
variation among scores assigned to the varieties. 


(W. A. Glenn) 


93192) 


Complex representation in the construction of rotatable designs—IJn English 


Ann. Math. Statist. (1959) 30, 771-780 (4 refs.) 


Response surface techniques are discussed as a generalisation 


- of factorial designs, emphasising the concept of rotatability; 


the property that the information (reciprocal of the variance) 
contours are hyper-spheres centered at the origin of the 
k-dimensional sample space where k denotes the number 
of factors. For the case of two factors, it is shown that the 
necessary and sufficient condition for a configuration of 
sample points to be a rotatable arrangement of a specified 
order d, is greatly simplified if the factor space is considered 
as the complex plane. In fact the condition is that 


Ses ee 
U 


for all integers a and Bb satisfying 0<a<2d, 0Sb<a/2 
where 2, is the complex variable x,+7y,, 2, is its complex 
conjugate x,—7y, and (x,, yu) is the uth sample point. 
Necessary conditions governing the combination of rotatable 
arrangements into configurations possessing a higher order 
of rotatability are discussed. 

These conditions are also sufficient for circular rotatable 
arrangements where a rotatable arrangement formed by N 
points 2, 22, .... 2v of the complex factor plane, is called 
a circular rotatable arrangement if the N points lie on a 
circle with the origin as centre. This is done by showing 
that certain coefficients must vanish in the ‘‘ design equation ”’ 
whose roots are the (complex) values of the various sample 
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points. A method is presented by which any configuration 
of sample points (for example, some configuration fixed by 
non-statistical conditions) may be completed into a rotatable 
design of the first order by the addition of only two properly 
chosen additional sample points. 

The authors mention that one of them has, by using an 
iterative process, calculated a table of circular rotatable 
arrangements of the first order each with seven points not 
situated at the vertices of a regular heptagon, and that it is 
hoped to publish the details of the computational procedure 
and the table of designs as a separate paper. 


(J. W. Wilkinson) 


BOSE, R. C. & DRAPER, N. R. (University of North Carolina, Chapel Hill) 


9.3 (9.2) 


Second order rotatable designs in three dimensions—In English 
Ann. Math. Statist. (1959) 30, 1097-1112 (9 refs., 3 tables, 2 figs.) 


This paper is devoted entirely to the construction of second 
order rotatable designs in three dimensions, i.e. for three 
factors. By the use of new construction methods various 
infinite classes of designs have been obtained. These 
infinite classes can be classified into three ‘main types 
according to their method of construction. Ten infinite 
classes of designs of one type are tabled with the number 
of points in each design ranging from 18 to 32. Four infinite 
classes of designs of another type are tabled with the 
minimum number of points in each design ranging from 29 
to 32. The third type contains one infinite class of 16 point 
designs. A table giving a partial listing of these designs is 
presented. 

Certain particular designs can be generated by the 
methods of construction that are used. Only two particular 
designs contain fewer points than the infinite class of 16 
point designs, these having 12 and 14 points, respectively. 
The authors point out that it may be shown that all the 
second order rotatable designs in three factors previously 
known can be derived as special cases of these infinite 
classes. 


(J. W. Wilkinson) 
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CHERNOFF, H. (Stanford Tae California) 
Sequential design of experiments—IJn English 


9.7 (5.7) 


Ann. Math. Statist. (1959) 30, 755-770 (6 refs., 2 tables, 1 fig.) 


This paper presents a procedure for the sequential design 
of experiments where the problem is one of testing a hypo- 
thesis H,. It is assumed there are two possible actions, 
viz., accept H, or reject Hy, and a class of available experi- 
ments. After each observation the statistician has to take 
the decision whether to seek further observations. If he 
decides to continue, he must select one of the available 
experiments. If he decides to stop he must select one of 
the two terminal actions. 

The approach used is motivated by Wald’s sequential 
probability ratio test for a simple hypothesis compared with 
a simple alternative when only one experiment is available. 
The risks corresponding to such a strategy are known to be 
functions of the probabilities of incorrect decision, cost of 
experimentation and the average number of observations. 
The average number of observations is a function of the 
Kullback-Liebler information numbers given by 


Ty = | log [fol X)/fulX)1fo(X) dx, 
Ty = | log LAX)/fol X) Ail Xax, 


fo and f, being the densities of the observed random variable 
under the hypothesis and alternative respectively. 

In the more general case where there are several experi- 
ments available at each observation time, and where the 
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hypotheses are composite, the choice of experiment at each 
stage is based on these information numbers and the 
maximum likelihood estimate 0, of 0. If 0, is consistent 
with Ho, then that experiment FE is chosen which maximises 
I,(£). If @, is consistent with H,, then the experiment FE, 
chosen is that one which maximises J,(£). ‘The stopping 
rule used is analogous to that of the usual sequential prob- 
ability ratio test. 

This procedure is shown to be ‘‘ asymptotically optimal ” 
as the cost of sampling tends to zero, which in turn implies 
that sample sizes will be large. As the author points out, 
the asymptotic optimality may not be relevant in the early 
stages of experimentation—especially if the cost of sampling 
is high. The procedure here is based essentially on assuming 
the maximum likelihood estimate at any stage is close to 
reality and it is efficient when this is so. 


(D. G. Chapman) 
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GIRI, N. C. (Indian Council of Agric. Res., New Delhi) 


On row balance in partially balanced incomplete block designs—In English 


9.1 (7.4) 


J. Indian Soc. Agric. Statist. (1957) 9, 168-178 (6 refs.) 


In this paper, it is shown that the following two classes c, 


and c, of partially balanced incomplete block designs with 
parameters: 


Cy: Uv = P(p— 1); b = p*, 1p; 
ee it A P—2), nm, =p —1 
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GRAYBILL, F. A. & WEEKS, D. L. (Oklahoma State University) 


can be arranged in rows such that each treatment is replicated 
at least m times and at most (m+1) times in each row and 
the arrangement of the (m+1)th replicate of the treatments 
in the rows is such that these form a partially balanced 
incomplete block design with parameters 
OU, 0 = Ry PRA Ags n; = 2 
age t ° ° 
Dig Py, hOT 1 ti 

The general intra- and inter-block analysis for these 

designs eliminating the row effects is also presented. 


(J. Roy) 


9.1 (7.3) 


Combining inter-block and intra-block information in balanced incomplete blocks—In English 


Ann. Math. Statist. (1959) 30, 799-805 (5 refs., 1 table) 


For balanced incomplete block designs associated with the 
Eisenhart Model III (blocks random, error random), two 
independent estimates of treatment differences, as well as 
a combined estimate, were given by Yates. This paper 
expresses the combined estimate in terms of a set of minimal 
sufficient statistics and shows that a slightly altered form 
of the estimate is unbiased. 
The mathematical model is written as 


Vigm — a+fitr, + iam, 


Perce thao g — 102 4..., t3 and m= 0, 1,....; 
 Mq. Here t is the number of treatments; 6 is the number 
of blocks; 1, is 1 if the gth treatment is in block 7 and 0 
elsewhere; and n = bk = tr, where k is the number of 
cells per block and 7 the number of replicates. A trans- 
formation is made so that the parameters to be estimated 
are a2 (the common variance of im), or (the variance of 
Enewb) ae eo tS Phage and + (a vector whose t—1 
elements are contrasts of the To). The set of minimal 
statistics is given at the end of Section 4 and consists of 
(t—1) vectors U = (u;) and X = (x;) and scalars y..., 
In Section 5 it is pointed out that u; is an 
unbiased (intra-block) estimate of 7;; x; is an unbiased 
estimate (inter-block) of 7;; S?/f is an unbiased estimate 
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of o%, where f = n—b—t+1; S*?/(6—t) is an unbiased 
estimate of o?+ko35; and y... is an unbiased estimate of p. 

Yates’s combined estimate of 7; has two forms, one if 
6% >0, another if 63 <0. In Section 7, the first of these 
estimates is written in terms of U, X, S** and S?. In 
Section 8 it is shown that if this first estimate is used, 
regardless of 63, then it provides an unbiased estimate of 
the 7;. 


(R. Hooke) 


GRIDGEMAN, N. T. 
Pair comparisons, with and without ties—In English 
Biometrics (1959) 15, 382-388 (9 refs., 3 tables) 


A probabilistic model for paired comparisons with the 
possible admission of ties is treated in this paper. ‘The 


model’s five parameters are the probabilities of the following 
events: 


a. Discrimination of the difference between items A 
and B. 

b. Deliberate choice of item A, given discrimination; 

c. Deliberate choice of item B, given discrimination; 

d. Arbitrary choice of A or B, given discrimination; 

e. Indiscriminate guessing. 


The probabilities of selecting A or B, or of reaching a 
te, are expressed in terms of these parameters. Admitting 
ties, the probabilities associated with discrimination tests 
between two stimuli of different intensity are functions of 
the probabilities of events (a) and (e); excluding ties, only 
the former probability is a parameter. Preference trials 
with obviously different items and possible ties involve the 
probabilities of (6), (c) and (d), while the model without 
ties depends only upon the probabilities of (b) and (c). 

It is argued that the admission of ties in paired com- 
parisons (followed by their omission from the analysis) 
increases the power of the test of the null hypothesis that 
the probability of discrimination ((a) above) between items 
A and B is zero. However, exceptions may occur for small 
numbers of repetitions, and tables are given to illustrate 
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(Div. of Applied Biology, Nat. Res. Council, Canada) 


9.5 (9.6) 


this behaviour of the power function. An empirical study 
of the effect of admitting ties is presented with data from 
a taste-testing experiment of known properties, with the 
conclusion that the presence of ties depresses the estimate 
of the probability of (a). 


(D. F. Morrison) 


9.2 (11.5) 


A general analysis of variance scheme applicable to a computer with a very large memory—IJn English 


¥. Ass. Comp. Mach. (1959) 6, 469-475 (1 ref.) 


Assume that the observations obtained in an r-factor 
factorial experiment, in which the ith factor has n; levels, 
have been stored in a computer in sequential order with 
the first number in cell 0000 and the last in cell n,,n3...n;. 
The complete analysis of variance is based on linear com- 
binations of the squares of sums of the observations. ‘This 
paper gives a method for computing these sums on an 
electronic computer. 

Since the observations are in order, the sums for the 
first factor can be obtained by summing the first 7, values 
for the first sum, the next 2 values for the second sum and 
soon. Similar sums for the second factor could be obtained 
by rearranging the order so that the observations were in 
order by the second factor. In an electronic computer this 
procedure is wasteful of computing time and other methods 
may be more efficient. 

The method developed by the author consists of com- 
puting the locations of all the observations which should be 
added together to obtain a certain sum. ‘The basis result 
of this paper is a formula by which the locations of all the 
observations which should be added into a certain sum can 
be obtained. An example is given to illustrate the use of 
this formula. 


(D. Teichroew) 
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HARTLEY, H. O. (Iowa State University, Ames) 


9.3 (4.3) 


Smallest composite designs for quadratic response surfaces—In English 


Biometrics (1959) 15, 611-624 (4 refs., 3 tables) 


Composite designs were introduced by Box and Wilson 
[F. R. Statist. Soc. B (1951) 13, 1]. These designs add 
certain additional points to the 2” factorial design (sometimes 
called “star designs’’). This paper gives a simple answer 
to the problem of determining how many and what co- 
efficients of the quadratic response surface of the form 


n n n 
Ye hota pie, Bie + Xt Begg; 

i=1 4=1 Wj) 
can be estimated. However, composite designs are examined 
only with regard to their merits as a single (one-piece) 
experiment which is supposed to supply directly estimates 
of the parameters in the above-mentioned response surface. 
The designs derived are based upon the following 

theorem: 


(a) In any composite design, in which no main effect 
is used as a defining contrast of the (1/2*)2” fractional 
replicate, it is always possible to estimate the follow- 
ing coefficients of the quadratic response surface 
given above: all linear coefficients (6,), all quadratic 
coefficients (6;;), and the constant (69); one of the 
product coefficients (6;;) selected from each of the 
alias-sets. 

(b) It is not possible to estimate more than one of the 
product coefficients (5,;) from each alias-set. 

The implication of this theorem is that the largest 
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HARTLEY, H. O. (Iowa State University, Ames) 


Smallest composite designs for quadratic response surfaces—In English 


Biometrics (1959) 15, 611-624 (4 refs., 3 tables) 


The last part of the paper deals with methods of obtaining 
the variance of the estimated response surface, for both to 
the above cases. 

The author points out that designs usually recommended 
as fractional replicates per se are not necessarily the best 
when used in composite designs. Alternatives permitting 
the estimation of the largest possible number of coefficients 

in the second-order surface are developed and their efficiencies 
discussed. 


(J. E. White) 
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number of coefficients of that response-surface can be 
estimated if each alias-set of the fractional replicate contains 
at least one two-factor interaction x,;x; permitting the selec- 
tion of one coefficient b;; from each of the alias-sets. 

Examples of such designs are then discussed. ‘The 
design matrix, X, for the general case is also given. In 
order to form XX’ the coefficients bo, b;, b;;, b;; must be 
arranged into groups as follows: 


A. The even order group: 0o, b;; 1 = 1, 2, ..., n. 

B. One alias group for any of those of the 2”—1 alias- 
sets which contain either at least one main effect or 
have a two-factor interaction x,;x; represented in the 
p selected terms. 


Examples of both groups are giver in considerable detail. 

The variances and covariances of the estimated co- 
efficients are then considered. Since any two coefficients 
in two different alias groups are uncorrelated the author 
confines himself to a single alias group and again dis- 
tinguishes the two cases: 


(i) A product term x,;x; has been selected from the alias 
group. 
(ii) No product term has been selected from the alias 
group. 
Expressions for variances and covariances are obtained” 
for both of the above cases. Expressions for certain relative 
efficiencies of alternative designs are also worked out. 


9.3 (4.3) 


continued 
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MORLEY JONES, R. (Dept. of Genetics, Univ. of Birmingham) 


On a property of incomplete blocks—JIn English 
J. R. Statist. Soc. B (1959) 21, 172-179 (6 refs.) 


The author shows that balanced incomplete block designs 
have the property that any two orthogonal treatment con- 
trasts have orthogonal inter- and intra-block components. 
Matrices D and A are defined, the elements being 1 or 0, 
according as the jth observation is in respectively the ith 
block or ith treatment or not. The incidence matrix 
specifying the design of the experiment is given as the 


product of A and the transpose of D. The elements of this - 


matrix specify the number of times that the ith treatment 
appears in the jth block. Diagonal matrices are formed 
from D and A times their transposes. The diagonals of 
these matrices give respectively the block sizes and the 
number of replicates of the treatments. 

A block contrast sometimes called an inter-block con- 
trast, and a treatment contrast are defined. A contrast 
which is orthgonal to all block contrasts is called an intra- 
block contrast. 

In section 3 a condition for resolving a treatment contrast 
into inter- and intra- block components is given. It is 
then shown that the necessary and sufficient condition that 
any two orthogonal treatment contrasts have both inter- 
and intra-block components orthogonal is that a certain 
suffixed summation over blocks involving elements of the 
design matrix, block sizes and number of replications per 
treatment should be independent of any two treatments 
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9.1 (0.6) 


chosen. The necessary and sufficient condition for a treat- 
ment contrast to be resolvable into inter- and intra-block 
components, each estimating the same treatment contrast 
is that it should be a latent vector of a certain matrix M. 
The elements of this matrix are suffixed summations depend- 
ing on elements of the incidence matrix, block sizes, and 
numbers of replicates of each treatment. 

The latent roots of M are investigated and an expression 
is obtained for the efficiency factor of a design. A proof is 
given of Yates’ theorem, stating that the sum of the relative 
losses of information on a set of orthogonal contrasts in a 
balanced arrangement is equal to the number of degrees of 
freedom confounded in each replicate. 

‘Two examples of the use of the matrix M in investigating 
the properties of a design are given, a 3 by 2 factorial 
experiment, and a balanced incomplete block. 


(T. I. McGlaughlin) 


9.4 (7.2) 


Note on optimum group size for progeny tests—In English 


Biometrics (1959) 15, 513-517 (2 refs.) 


Robertson [Biometrics (1957) 13, 442-450] developed a 
solution for optimum number of sires to test in a progeny 
testing plan. His solution assumed a fixed total number of 
progeny to be tested in each generation, and one tested 
offspring from each dam. Rendel [Biometrics (1959) 15, 
376-381] extended the solution by Robertson to half-sib 
selection. The criterion for optimum number was, in 
each instance, the expected genetic superiority of the selected 
sires or progeny groups. 

The author has considered the case in which each sire 
has several mates and each mate has several offspring. 
Equal numbers of mates per sire and equal numbers of 
offspring per dam (as well as a fixed total number of off- 
spring) are assumed. The expected genetic superiority of 
the selected sires is expressed as a function of (a) the fraction 
selected, (b) the regression of breeding value of the tested 
sires on their progeny means, and (c) the standard deviation 
of the sire progeny means. ‘The resulting formula for 
genetic superiority is recast into terms of (a) the fraction 
selected, (b) numbers of mates per sire and offspring per 
dam, and (c) variance components among full sibs, dams, 
and sires. 

Two cases are examined, namely the optimum number 
of sires to test for fixed values of (a) the number of offspring 
per dam and (b) the number of mates per sire. Using 
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values of 1000 total offspring, 5 sires selected, and herit- 
ability equal to 0:1, expected genetic superiority (and hence 
expected genetic gain) is maximum with only one offspring 
per dam. However, the loss is only 5 per cent. when 
offspring per dam is increased to 10. Similarly, the optimum 
number of mates per sire is 25, but the loss is only about 
3 per cent. with 5 mates per sire. [The author suggests 
that economic values and costs of the testing programme 
should be considered, in a practical solution. 


(N. R. Thompson) 
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QUENOUILLE, M. H. (London School of Economics) 
Experiments with mixtures—In English 
J. R. Statist. Soc. B (1959) 21, 201-202 (2 refs.) 


Experiments with mixtures are factorial experiments where 
the total level of all factors may not exceed a certain level 
(and is usually restricted to this level). 

Scheffé considered the interaction between two factors 
_A and B as a quadratic regression coefficient and Quenouille, 
considering equal proportions of A and B together with a 
non-interacting factor C, notices that Scheffé’s model leads 
to an AB interaction which is not proportional to the total 
level of A plus B; he says that this is ‘‘ not completely 
additive’. He notices a similar effect when all interactions 
are present in a 2X2xX2 factorial experiment with the 
mixture restraint and the factors present in equal pro- 
portions. ‘The author remarks that problems of design, 
analysis and interpretation differ as between Scheffé’s model 
and a model where there is complete additivity. 


(D. E. Barton) 
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RENDEL, J. M. (Sydney University, Australia) 


Optimum group size in half-sib family selection—In English 
Biometrics (1959) 15, 376-381 (2 refs., 2 tables, 1 fig.) 


The objective of this paper is to arrive at selection pro- 
cedures which maximise the genetic superiority of selected 
half-sib groups. The author states that Robertson’s 
[Biometrics (1957) 13, 442-450] expression having to do 
with selecting by progeny testing may be applied to selecting 
half-sib groups for practical purposes, but that this is not 
strictly true. A formula for the correlation between the 
breeding value of a half-sibship and its mean score is 
_ presented, followed by a formula for genetic advance when 
selecting a half-sibship. Similar formulae are presented 
for the progeny test situation. Tabular values of the genetic 
superiority of selected half-sibships and of selected sires 
under varying conditions with N animals tested in all, n 
_ for each sire, and h? the fraction of variance that is genetic, 
are shown. Comparisons among the genetic superiorities 
for any fixed number tested and heritability value enable 
the reader to choose the optimum group size that should 
be used. The principal field of application of the results 
of this study is poultry breeding. 


(J. A. Gaines) 
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SCHUMANN, D. E. W. & BRADLEY, R. A. (Univ. of Stellenbosch, S.A. & Viginia Polytechnic Inst.) 
The comparison of the sensitivities of similar experiments: 
Model II of the analysis of variance—In English 


Biometrics (1959) 15, 405-416 (4 refs., 5 tables) 


This paper is concerned with the analysis of data which 
arise from necessarily subjective appraisals of an attribute 
of sample units which may be ill-defined, such as quality, 
performance, preference, etc. ‘The problem becomes one 
of choosing the scale of ‘‘ measurement’ (or experimental 
technique) which is most likely to disclose treatment differ- 
ences when they exist, and the authors propose a test 
whereby the sensitivities of analyses on two sets of observa- 
tions for the same set of treatments may be compared, the 
two sets of observations being referred to two different 
scales of appraisal, and the two parallel experiments being 
distinct and independent in probability. The development 
is limited to such pairs of experiments which are similar, 
that is, which allow the same degrees of freedom for their 
respective treatment mean-squares and the same degrees of 
freedom for their error mean-squares. 

A brief review of the authors’ two earlier papers is 
presented and in the present paper the test is extended to 
the comparison of sensitivities of experiments when Model II 
applies, and the previous tables are extended to give 95:0, 
97-5, 99-0 and 99-5 percentage points of’ the ratio of two 
central variance ratios with the same degrees of freedom, 
2 to 21, 30, 40, 60, 120 for the numerators and 2 (1 to 10, 
15, 20, co) for the denominators. 

For Model II analysis of variance let ob, be the variance 
of treatment effects and let a; be the residual or error variance 
for the jth experiment, 7 = 1, 2. Then the null hypothesis 
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SHRIKHANDE, S. S. (University of North Carolina, Chapel Hill) 
The uniqueness of the L» association scheme—In English 


Ann. Math. Statist. (1959) 30, 781-798 (10 refs.) 


The author considers an incomplete block design for v 

treatments in b blocks, each containing k (less than wv) 

different treatments, such that each treatment occurs the 

same number of times in the design and such that any two 
_ different treatments occur together in either A, or A, blocks 
where A, and A, are different non-negative integers. It is 

shown that the “‘ association scheme ”’ of the design is of a 
_ particular type known as L, association scheme, if 


(()) Ones GH SSA Ore Sea: 

(ii) each treatment occurs with exactly (2s—2) treat- 
ments in A, blocks and with the remaining in ); 
blocks 

(iii) with respect to any two treatments which occur 
together in A, blocks, there are exactly (s—2) other 
treatments, each one of which occurs with these 
two treatments in 4, blocks 

(iv) with respect to any two treatments which occur 
together in A, blocks, there are exactly two other 
treatments which occur with each of these two 
treatments in A, blocks. 


The L, association scheme is explained by Bose and 
Shimamoto [¥. Amer. Statist. Ass. (1952) 47, 151-184]. 
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9.9 (7.2) 


for each experiment is that oD, = 0, and when real treatment 
differences exist, the variance-ratios of treatment mean 
square to error mean square are distributed asF,,(1 +hjo5,/ o*) 
with 2a; and 2b, degrees of freedom (a; and b; are integers 
or half-integers) where k; is the number of observations 
for each treatment. The measure of sensitivity proposed 
is the ratio oG,/o%, and letting y; = hyopjlo%, the null hypo- 
thesis for the test of significance on sensitivities is that 
¥1/Ry = y2/ke, and the test statistic is the ratio of the two 
variance-ratios. For similar experiments in general, approxi- 
mate percentage points of the test statistic may be found 
from the given tables, and in the case of identical experi- 
ments exact percentage points are those in the tables. Also 
outlined is the procedure for finding confidence limits for 
the ratio (1+y,)/(1 +2). 

For the case of identical experiments a numerical example 
is presented of the procedures for making the test of 
significance and for finding confidence limits. "The observa- 
tions are on the quality of sheets of dried veneer, with scales 
varying respectively from 0 to 50 and 0 to 8 with O in each 
case indicating a sheet free of degrade. "The analyses are 
two-way classifications (sheets and observers) with inter- 
action, where the component of interest is the variance for 
sheets. ‘The result of the comparison of sensitivities of the 
two experiments is non-significant, and 95 per cent. con- 
fidence limits are computed for the above-mentioned ratio. 


(C. C. Thigpen) 
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It has also been shown that for the case s = 4, besides 
the L, association scheme defined above there is one more 
type of association scheme possible. An actual example 
corresponding to this other association scheme is also given. 

Necessary conditions for the existence of a design with 
the L, association scheme and v = 8B are given. 


(S. Shrikhande) 
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WEARDEN, S. (Kansas State University) 


9.4 (7.2) 


The use of the power function to determine an adequate number of progeny per sire 
in a genetic experiment involving half-sibs—Jn English 


Biometrics (1959) 15, 417-423 (6 refs., 2 tables) 


The author is particularly concerned with genetic experi- 
ments which involve large animals. Since the use of large 
animals in such experiments is expensive and time-consuming 
it is of utmost importance that the experiment accomplishes 
its desired aim. ‘To do this the experimenter must know 
when he has an adequate number of observations per genetic 
group before he undertakes a genetic study concerning an 
estimate of heritability large enough to be of practical 
importance. 

A common genetic group used in studies of large animals 
consists of paternal half-sibs. Such is the case in most 
progeny tests, and one of the most common methods of 
estimating heritability is a function of the paternal half-sib 
correlation. ‘The author takes advantage of the fact that 
the test for this correlation (and hence heritability) can be 
put into the form of an analysis of variance for a completely 
randomised design under Model II. Using the easily 
determined power of this test, the author describes a pro- 
cedure to determine the number of observations needed 
per group (of half-sibs) in order, with given power, to 
detect differences within groups. The formula stated 
requires an iterative solution that is simple to calculate 
and rapidly convergent. [wo tables give the calculated 
numbers of progeny per sire for varying numbers of sires 
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and different size estimates of heritability according to the 
probabilities of the Type I and Type II errors. 

An example is presented that illustrates how an experi- 
menter might actually apply the procedure in an experiment 
where he wished to estimate the additive genetic variance 
or conduct a progeny test with respect to a certain trait. 


(R. J. Taylor) 


AKAIKE, H. (Institute of Statistical Mathematics, Tokyo) 
On the statistical control of the gap process—In English 


10.2 (8.9) 


Ann. Inst. Statist. Math., Tokyo (1959) 10, 233-259 (11 refs., 9 figs.) 


The gap process is defined as a strictly stationary discrete- 
time-parameter process {X,(w); —o<n<-+c} where 
X,,(@) = 1 and 0 with probability one and the gaps between 
successive 1’s of the process are mutually independent and 
follow one and the same gap distribution {p,; v = 1, 2, ...} 
with L = SJ vP,< oo. 

This definition was introduced by the author in his 
previous paper [Ann. Inst. Statist. Math., Tokyo (1956) 8, 
87-94] as a model which represents the sequence of time 
points of renewals on the stationary flow. The variance- 
time curye of the process is defined as {(V(h, h); h = 1, 

h 


2, ...} where V(h) represents the variance of 2 X;,, (the 
V— 1 


number of 1’s in a time interval of length h). 
Then V(f) is represented as V(h) = P{2R(h) —h}—h?P? 
h-1 


where P= 1/L, R(h) = R(h—1)+ 2 P, with R11) =1 
v=0 


and P,= 2 p,P,-, with P) =1. In the present paper 


he 
the asymptotic distribution of an estimate V(h) of V(h) is 
given where V/(h) is defined by replacing the p,’s in the 
definition of V(h) by #,’s, the observed relative frequencies 
of the gaps of length v. By using the idea of the differ- 
entiable statistical function introduced by von Mises [Ann. 
Math. Statist. (1941) 12, 263-267] the author obtains, with 
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BALAKRISHNAN, A. V. (University of California, Los Angeles) 


On a characterisation of covariances—In English 
Ann. Math. Statist. (1959) 30, 670-675 (4 refs.) 


Let x(t) be a possibly-complex-valued stochastic process 
(random function). x(t) is said. to be of second order if 
the expected value of [x(t)]? exists. In this case the co- 
variance function F(s, t) of x(t) also exists and is a non- 
negative definite Hermitian function. Let m(s) be a complex 
valued function, defined for —o<s<+o. It is then 
only a trivial matter to show that F(s, t)+m(s)m(t), where 
the horizontal bar denotes the complex conjugate, is also a 
covariance function. ‘This, however, is not true if one 
considers instead of F(s, t)-++m(s)m(t) the function F(s, t)— 
m(s)m(t). 

The author assumes that the process is stationary of the 
second order; this means that F(s, t) = R(s—t) depends 
only on the difference s—t. He gives necessary and sufficient 
conditions which assure that R(s, t) = R(s—t)—m(s)m(t) is 
a covariance function. One of these conditions states that 
m(t) shall be the Fourier transform of a function p of bounded 
variation. The second condition involves the spectral 
distribution G corresponding to R(t) and requires that the 
L,(dG)-norm of the Radon-Nikodym derivative du/dG be 
less or equal to unity. nih 

The author applies his result to the problem of estimating 
the mean value of a stochastic process. 


(E. Lukacs) 
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the aid of the generating function method, the result that 


(o/6) 
under the condition 2 v®p,< +0/N(V(h)—V(h)) tends 
v=1 
asymptotically to the normal distribution with zero-mean 
and variance D? : N being the number of the independent 
observations of the gaps used to define the ,’s.. 'This 
asymptotic result is checked. by Monte Carlo methods for 
various types of {p,} and finite N’s. The approximation is 
fairly good, even in some cases for N = 250. * 

The results of the Monte Carlo verification and the 
formal analysis of the opening sections of the paper are 
used to construct a statistical control system for the gap 
process. ‘The model is taken from a silk-reeling process 
and some numerical examples are given at the end of the 
paper. 

The statistical control system is seen to centre upon the 
appropriate choice of the length of the intervals of observa- 
tion (h) and the mean value of v,(h), certain rules are given 
and a summary procedure for setting up a statistical control 
system for a gap process. A warning is given in connection 
with a form of systematic bias that can be introduced by 
using the gap process with discrete time parameter as 
examined in this paper, to approximate a continuous time 
parameter process. 


(H. Motoo) 


10.7 (10.2) 
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CANSADO, E. 


Stochastic processes are first defined as a special case of 
random functions. Starting with the simple concept of the 
random variable, the case is considered of circular or angular 
distributions, and the parameters of the uniform circular 
distribution are given. 

In order to obtain circular distributions, a linear dis- 
tribution is ‘‘ wrapped”’ around a circle in a way similar 
to methods previously used by Lévy, Marcinkiewicz and 
-Wintner for the case of a normal or Cauchy’s distribution 
(normal ‘“ wrapped-up”’ distribution, angular case of 
Cauchy’s distribution). The procedure for the “ wrapping- 
up” is then graphically represented: formulae for the 
distribution function and the density function of the circular 
variate are also given. ; 

Another interpretation of the “‘ wrapped-up ”’ distribu- 
tion is made, imagining a circle which rolls without sliding 
on a straight line taking in each point of contact the mass 
of the corresponding point in the line. (Both the origins 
of the circle and its tangent are supposed to coincide.) 
‘The transformation of the linear uni-dimensional distribution 
into the circular bi-dimensional distribution is given in 
terms of the random variable. 

The characteristic function of a real uni-dimensional 
random variable is defined in the usual way, later being 
also considered as the mean of a stochastic process, according 
to the definition. In order to analyse the process, its 
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PIRESCU. D. (Bucharest. University) 


(Inter-American Training Centre, Santiago) 
The characteristic function as the mean of a stochastic process—In Spanish 


Estadistica Espafiola (1958), 1, 19-31 (5 refs., 5 figs.) 


10.0 (2.9) 


realisations or trajectories are examined as well as the 
probability distributions corresponding to different values 
of t. It is seen that to each particular value of the random 
variable there is a corresponding trajectory in the three- 
dimensional space which is the Cartesian product of the 
complex plane (y+7z) and the straight line (#). This 
trajectory is an ordinary helix which forms a constant angle 
with the generators of its containing cylinder. All the 
trajectories are symmetric with respect to the axis y (real 
axis of the complex plane) and therefore have Hermitian 
symmetry. On the other hand, for each particular value 
of t, the random variable e* has a bi-dimensional distribution 
in the complex plane, and this distribution is a ‘‘ wrapped ”’ 
circular one. For t = 1 the circular distribution is obtained 
by “‘ wrapping ’”’ the linear distribution round itself. For 
t = 0 and any distribution of €, t has a singular distribution, 
with all the mass (or probability) concentrated at the origin. 
The corresponding circular “‘ wrapped-up”’ distribution 
has all its mass concentrated in a point. 

The linear distribution corresponding to —té is obtained 
by symmetry from the distribution of té. Therefore the 
“wrapped ” distribution obtained from the linear distribu- 
tion of —¢é is the circular distribution obtained by symmetry 
with respect to the real axis of the complex plane from the 
circular distribution corresponding to ¢é. 


(F. Azorin) 


10.2 (4.8) 


Estimation functions for the inverse transition probabilities of a simple stationary 
Markoff chain with a finite set of states—In Rumanian 
Analele Univ. “‘ Parhon’’ Seria St. Naturii (1959) 21, 15-23 (5 refs.) 


This is the third paper in a sequence by the author dealing 
with estimation properties of a particular kind of Markoff 
chain: it is assumed that the stochastic matrix of this type 
of Markoff chain is not decomposable. 

The first paper [Analele Univ. “‘ Parhon”’ (1959) 18, 
9-18] showed that the estimation functions corresponding 
to a subset of unknown transition probabilities belonging 
to a fixed row are asymptotically Gaussian. ‘These functions 
become asymptotically efficient only in a case which was 
studied by Mihoc [Bull. Math. Soc. Sci. Math.-Phys. R.P.R. 
(1951) 1, 449-456]. This case studied by Mihoc was where 
the particular subset under consideration coincides with the 
set of all transition probabilities of the fixed row. 

In the second paper by the author [Analele Univ. 
“‘ Parhon”’ (1958) 20, 37-47] the results were extended to 
the case of an arbitrary subset of unknown transition 
probabilities. In this case, asymptotic efficiency is reached 
when the subset coincides with the set of all transition 
probabilities forming the stochastic matrix of the given 
Markoff chain. 
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In the third and present paper, analogous results are 
shown to hold true for an arbitrary subset of unknown 
inverse transition probabilities. ‘The condition for this is 
that the characteristic equation has a simple and unique 
root of unit absolute value. 


(R. Theodorescu) 
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_ secutive steps is represented by Pj. 


FUCHS, A. (Strasbourg University, France) 
A problem of attained times—In French 
Publ. Inst. Statist. Paris (1958) 7, 161-166 (3 refs.) 


This paper considers a time homogenous discrete Markoff 
chain with a finite number (r) of states. The probability 
that the system passes from state (7) to state (j) in n con- 


variable 7;(G) is independent of m, when (G) is any group 
of states. [The random variable T;(G) is termed “‘ the 
attained time’ of the group of states (G) on leaving state 
(7). ‘The concept is particularly interesting when the group 
of states (G) is a final, or end group: the case where the 
group is reduced to a single element (j) is treated in detail. 

The probability law of the random variable T;; is given; 
T;; being the attained time to reach state (/) from state (2). 
The probability that having left state (7) the system arrives 
at state (j) for the first time after m consecutive steps is 
denoted by O%. After stating a theorem relating to T;;, the 
author considers the expectation of T;;, including the case 
where (7) is an absorbing state. 

The paper concludes with an urn example consisting of 
random drawings with replacement from a set of balls of 7 
different colours. 


(W. R. Buckland) 
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GABRIEL, K. R. (Hebrew Univ. Jerusalem, and N. Carolina) 


It is shown that the | 


10.1 (1.3) 


10.1 (1.3) 


The distribution of the number of successes in a sequence of dependent trials—IJn English 


Biometrika (1959) 46, 454-460 (5 refs., 3 tables) 


This paper considers a simple Markoff chain of n+1 events 
(either successes or failures) so that the probabilities p, and 
py (being the probabilities that a success followed by 
respectively a success and a failure) are the only other para- 
meters. The distribution of the number S of successes is 
studied with particular reference to the adequacy (or other- 
wise) of the normal limit for approximation when 7 is small 


[see Feller, Introduction to Probability Theory and its Applica- 


tions 1 (1947), New York, Wiley]. 

The probability distribution function of S is obtained 
by summing over the joint probability distribution function 
of S and the number of runs (of success or failure) as given 
by David (1947). Lengthy expressions for the first four 
factorial moments of S are obtained (in terms of n, d = pi —Po 


_ P= p,/(1—d) and R, the possibility of the initial event 


being a success) by means of the method of characteristic 
random variables. ‘The central moments under various 
simplifications are then given (for R = P and neglecting 
terms in d”). The latter case gives Markoff’s values for the 
first two moments and simple expressions for the cumulant 
ratios. 

Tables of exact and approximate values are given for 
P = 0-6 and 0-8, n = 7 and 30 and d = 0(0-2) 0-8. When 
d is equal to zero the distribution is binomial: as d increases 
so do the variance and measures of skewness and kurtosis. 
The closeness of the approximations appear to the author 
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to depend largely upon d being small and nm large. An 
example shows that at least in one case the normal approxi- 
mation using the large sample variance estimate is very 
close to the discrete exact distribution. A final comment 
made in the paper is that the rather tedious computation 
of the exact probability distribution is facilitated by tables 
of the logarithms of binomial coefficients [see Hald, Statistical 
Tables and Formulas: Table 14 (1952) New York, Wiley]. 


(D. E. Barton) 
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GHOSAL, A. (C.S.I.R., Jealgora, Bihar, India) 


10.4 (1.1) 


On the continuous analogue of Holdaway’s problem for the finite dam—In English 


Aust. #7. Appl. Sci. (1959) 10, 365-370 (4 refs.) 


This paper applies Prabhu’s method [Quart. ¥. Math. 
(1958) 9, 183-188] of solving the integral equation for the 
stationary distribution function of a dam content (which 
arises in Moran’s theory of dams) to a more complicated 
model suggested by Holdaway in which inputs have a 
negative exponential distribution [Moran, Aust. ¥. Appl. Sci. 
(1955) 6, 117-130). 

In Holdaway’s scheme, the content Z; = 0, 1, 2, ...k, 
of the dam at discrete times t = 0, 1, 2, ... is given by the 
equation 
Min{Z,+X;, k+m}—m for Z,+X,2l+m 
Min{Z,+X;, 1+d}—Min{Z,+X,, d} 

for Z,+X,;<l+m 


where 1<k, d<m<k; the {X;} are independent discrete 
inputs which flow into the dam in the intervals (t, t+1) 
and have identical probability distributions {p,} (¢ = 0, 1, 
Drs) 

The processes {Z;} and {Z,+X;,} are Markoff chains, 
and equations for the stationary probabilities R; =Pr 
{Z,+X, = 1} @ = 0, 1, 2, ...) in this model are given. 

When X; and Z; take on continuous values, the distribu- 
tion function of the inputs in the intervals (t, t-+1) being 


a 
Pr{X,<x} = G(x) = | ‘g(ddt, with t = 0, 1, 2, ... 


Lisa ci 


remain- 


0 
ing discrete, the analogue of the previous process gives rise 
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GILBERT, E. J. (Sandia Corporation, Albuquerque) 


to the following integral equations for the distribution 
function: H(y) = Pr{Z,+X,<y} of Z,+X;: 


H,(y) for y<l, 
Ay) = [26) 1416 for 1Sy<hk, 
Ai, (y) +H2(y)+G(y—k) for y=k 
Min (d+y, d+1) 
where H,(y) = | A(t)g(y+d—t)dt, and 
d 


Min (m+y, k+m) 
Ay) = | HA(t)g(y +m —1)dt. 
l+m 

For inputs X, having a negative exponential density 
function g(x) = p/exp (ux), the author makes use of the 
transformation 4(y) = 1/exp (uy){1—H(k+m—y)} to solve 
these integral equations. The stationary distributions of 
Z,+X;, and of the dam content Z; are obtained explicitly. 


(J. Gani) 


10.2 (10.1) 


On the identifiability problem for functions of finite Markoff chains—In English 


Ann. Math. Statist. (1959) 30, 688-697 (3 refs.) 


Suppose a stochastic process is known to be a stationary 
irreducible Markoff chain with a finite number of states. 
Associated with such a Markoff chain is a matrix of transition 
probabilities; that is to say, the probabilities that the process 
passes from state 7 to state j, for all ij. For some reason 
the states of the process cannot be directly observed, but 
the states of the process are divided into groups, such that 
it is possible to identify the group from which an observation 
came. ‘The’ observable process is again stationary; that 
is to say the transition probabilities do not change with 
time: but it is not in general a Markoff chain. Hence the 
transition probabilities depend on the past history of the 
process. 

A given transition matrix, together with the function 
which partitions the states into groups, uniquely determines 
the distribution of the observable process. For any given 
partition function, there may be more than one transition 
matrix which gives rise to the observable process. Hence 
even if the entire distribution of the observable process is 
known, the matrix of transition probabilities for the original 
process cannot be determined. The general problem studied 
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in this paper is to specify the characteristics of the observable 
process which make it possible to identify the class of 
Markoff chains (transition matrices) which could give rise 
to it. A characterisation of the number of identifiable 
parameters is obtained under a specified regularity condition. 
Some further conjectures are also noted. 


(D. G. Chapman) 
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HAIGHT, F. A. (University of California, Los Angeles) 
Queuing with reneging* —Jn English 
Metrika (1959) 2, 186-197 (10 refs.) 


This article deals with.a generalisation of the usual queuing 
theory, attempting to take into consideration the rational 
behaviour of an individual queuing up. : 

The author starts with the assumption of a queue 
having a Poisson input, and an exponential service time 
distribution. An individual arriving at this queue can 
behave in the following ways: (a) baulk, i.e. not join, 
(b) join and subsequently renege, i.e. go away or desert, 
and (c) join and remain for service. 
the generalisation in contradistinction to the models used 
up to now. Baulking then is the special case of deserting on 
arrival. 

Three aims can be distinguished: 


(i) To give a decision scheme for reneging, by means 
of which an individual can behave rationally in 
uncertainty; 

(ii) To investigate the expected result of such a scheme 
of decision for the individual, and 

(iii) ‘To describe the behaviour of a queue in which such 
schemes are operative among all queuers. 


The author stresses the fact that in his paper these problems 
have not reached a final solution: he also suggests that it 
might be possible to arrive at a better formulation. 

All the following models contain W, the maximum 
tolerable waiting time. Finding a decision scheme which 
reflects the rational behaviour in the queue, is reduced to 
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HATORI, H. (Department of Mathematics, Tokyo Inst. of Tech.) 


Actually (6) represents’ 


10,4 (—-) 
(* deserting or defecting) 


the definition of a function 4(t) (t = time) which should 
be “as good as possible”. ¢(#) is to be selected from a 
class of functions with the following properties: non- 
decreasing, $(0)<0, ¢(00) = o, and the queuer will desert 
when n(t) and ¢(t) intersect; n(t) is a step function indicating 
the number of departures from the queue as a function of 
time. In this article, only a special class of functions ¢ is 
considered, namely a one-parameter family of hyperbolas. 
Choosing the hyperbolas out of the class of functions, 4, 
is not based on a criterion of optimalisation, but on additional 
assumptions concerning the individual’s behaviour. 

Having selected a certain function ¢(t) and, furthermore, 
having determined the value of the time used for waiting 
and service as well as the value the individual attributes 
to being served in his utility scale, we can find an utility 
function U—thus entering the sphere of the decision 
problem itself. The practical usefulness, however, is limited 
by the fact that the connection between U and any particular 
¢(t) is not easy to obtain. 

In the last section the state probabilities for a queue 
with baulking and deserting are computed. The main 
difference characterising this approach compared to the 
usual type, is the use of an additional function, I(x), having 
the property that the probability of one deserter in At is 
II(x).At when the queue contains x members. Thus the 
author succeeds in finding explicit solutions for the asymp- 
totic state probabilities. 


(F. Ferschl) 


10.5 (—.-) 


A note on the entropy of a continuous distribution—In English 


Kodai (Tokyo Inst. Tech.) (1958) 10, 172-176 (4 refs.) 


Following the definition of the entropy of a continuous 
distribution (after Shannon), Goldman has given an explana- 
tion on the basis of the discrete case and from some postulates 
Reich has derived an expression for the information rate. 
In this paper the author gives another set of postulates 
(which are plausible) from the discussions for the discrete 
case by Shannon and Khinchine. It is shown that the 
“entropy H(f) of an n-dimensional continuous distribution 
with probability density function f(«;, ..., ¥») which satisfies 
the set of postulates must be expressed by 


Hf) = NSF Cer, -o- #0) log f Cer, -.- 


where x is a positive constant. 

The postulates are: 

(i) The entropy H(f) of a continuous distribution 

with density function f is determined by f alone. 

(ii) The entropy H(f ) of the distribution with density f 
is smaller than the entropy H(g,) of the distribution 
with density g, which is uniform on a subset S of 
the 2 dimensional Euclidian space, if S contains the 
carrier of f and f#g, (almost everywhere). 

Let 4(x,, ...%,) be the probability density function 
of the first k random variables among the x variables 
and xy, ...Xx (X41) ---) Xn) the conditional probability 
- density function of the remaining n—k variables 


BR) Oren aX 


(iii) 


496 


under the condition that the first k variables have 
respectively taken the values x,, ..., x,. Then 


A(f) = H(4)+J.. JA ¢d(ar, -. AX. 
If the number of different values of f is finite, 
H(f) depends only on these values of f and the 
Lebesgue measures of the sets of those points at 
which f is a constant. H(f) does not depend on 
the dimension number n. 


“9 xp)Axy, le 


(iv) 


(I. Higuti) 
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JOSHI, D. D. (Panjab University, Chandigarh, India) 


10.5 (4.1) 


Information in mathematical statistics and in communication theory—In French 


Publ. Inst. Statist. Paris (1959) 8, 82-159 (39 refs.) 


This review paper consists of an introduction and four 
sections (called chapters) as follows: 


(1) Fundamental theory of information theory: discrete 
case 
(ii) The problem of binary coding. 
(iii) The fundamental theorem: general case. 
(iv) Information in mathematical statistics. 


Since the original work of Shannon the theory of inform- 
ation has made rapid progress and now has many fields 
of application. In the author’s opinion two papers by 
Khintchine [Uspekhi Mat. Nauk. (1953) 8, 3 and (1956) 
11, 17] provide the best exposition of Shannon’s theory. 

In the first chapter, the present author introduces the 
two fundamental concepts of entropy and capacity and shows 
that they are bound up with different problems of coding. 
This leads to a consideration of binary coding which is 
studied in the second chapter: the original work of Hanning 
[Bell Syst. Tech. F. (1950) 29, 147] is cited together with 
two theorems due to Zaremba [¥. Lond. Math. Soc. (1952) 
27, 242]. The chapter concludes with two new inequalities 
on the problem of optimal coding. 

The third chapter envisages the possibility of generalising 
Shannon’s theory to abstract spaces. After considering 
the concept of a transmission line in measurable space, 
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KAC, M. & SLEPIAN, D. (Cornell Univ., Ithaca, N.Y. and Bell Telephone Labs., Murray Hill, N.J.) 


Large excursions of Gaussian processes—In English 
Ann. Math. Statist. (1959) 30, 1215-1228 (7 refs.) 


.The probability distribution is sought of the interval a 
Gaussian stochastic process x(t) attains above a level x = a 
on a single excursion; this interval is called the “‘ recurrence 
time”? of the level. The process has zero mean, unit 
variance, and an autocovariance p(7). Results are given 
for the limits a>0oo and a>— ©, with main attention paid 
to the former. Previous derivations required x(t) to be 
twice differentiable; here differentiability once is all that 
is necessary. 

The distribution sought is Pr{x(t’) >a, 0<t’ St | x(0) =a, 
x'(0)>0}. The specification of conditions on both x(t) and 
its derivative x’(t) at t = 0 involves certain difficulties in 
the definition of conditional probabilities, which are discussed 
at length. ‘To define a conditional probability density like 
p(é | «(0) = a), €=-x'(0), one must form the density 
p(é| A), where A is some condition of finite probability 
p(A), and one must then pass to a limit in which the condition 
A reduces to x(0) = a. The form of the conditional density 
depends on this limiting process. In the “ vertical window ”’ 
definition, A is ‘“aS<x(0)Sa+5”, and one lets 5>0, 
obtaining a normal distribution for € with zero mean and 
variance « = | p(0)|. In the “ horizontal window ”’ 
definition, A is “ x(t) = a for 0St<6”, and when 8>0 
one finds that p(é x(0) = a) = (| € |/2) exp (—€?/2«). 
Other choices of J lead to still other forms. The empirical 
definition, in which one measures = x(t) at all points 
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and having defined its capacity, a theorem is given which is 
analogous to Shannon’s fundamental theorem. 

The final chapter is devoted to an examination of the 
concept of information in mathematical statistics. Many 
authors have tried to show the unity between the concept 
of information due to Fisher and that of Shannon-Wiener. 
In particular the author refers to Schutzenberger’s paper 
[Publ. Inst. Statist. Paris (1954) 3, 3] and Barnard’s work 
on an axiomatic theory [ 7. R. Statist. Soc. B (1951) 13, 46]. 
The author re-examines the problem from a somewhat 
different point of view. Finally, two problems are studied: 
a test of hypothesis and discrimination. It is shown that 
the Shannon-Wiener theory can be used for the former 
and the “‘ distance ”’ function given by Chernoff [Ann. Math. 
Statist. (1952) 23, 493] for the latter problem. 


(Mile. Gervaise) 


& 


10.1 (2.9) 


where x(#) = a in a realisation of the process and forms the 
probability density of the results, is equivalent to the 
horizontal window definition. The horizontal window 
definition is related to Smoluchowski’s definition of the 
mean recurrence time of a stochastic process. 

The distribution of recurrence times is found in terms 
not of x(t) but of a scaled process A(é, 6) = [x(6t)—a]/@, 
where 6 is the expected length of intervals during which 
x(t)>a. For a>1, @~a—(2z/a)!/?. By the method of 
characteristic functions it is shown that when a—>oo the 
sample functions of A(t, 6) become a family of random 
parabolas. By means of the inclusion and exclusion formula 
the probability density Q(7) for the recurrence time T of 
A(t, 6), given A(O, 0) = 0, A’(O, 6)>0, is derived. When 
the two conditions at t = 0 are imposed in the horizontal 
window sense, O(T) = (77/2) exp (—7T?/4), and when 
they are imposed in the vertical window sense, Q(T) = 
exp (—77T?/4), T>0; O(T)=0, T<0O. The probability 
density for a>—o is Q(T) = 2/exp(2T) under both 
horizontal window and vertical window conditions, where 
now @~ 27a-1/2exp (a?/2). An appendix provides formulas 
for means and covariances of general multivariate normal 
conditional probability densities. 


(C. W. Helstrom) 
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KALLIANPUR, G. (Michigan State University) 


10.2 (6.1) 


A problem in optimum filtering with finite data—In English 


Ann. Math. Statist. (1959) 30, 659-669 (3 refs.) 


The first problem treated is to find a filter to estimate the 
value at time T'+T7, of a signal x(t) = Vg(t) on the basis 
of observations of the sum 2(t) of signal and noise over the 
finite interval 0<t<T. The estimate sought is the one 
that minimises the expected square deviation. g(t) is a 
known continuous function that does not vanish everywhere 
in 0StST, and its amplitude V is a random variable with 
zero mean and a known probability density h(V). The 
“noise ’’ is Gaussian with a covariance function R(t, s) 
that is continuous in the square {0 St, s<T}. It is assumed 
T 


that the integral equation { R(t, s)p(s)ds = g(t) has a square 
J 0 


integrable solution. The best estimate of x(T+7)) is its 
conditional expected value, given 2(t) over OStST; this 
estimate is derived by using an expansion of 2(t) in eigen- 
functions of the covariance R(t, s). It turns out to be just 
g(T-+T,) times the conditional expectation of the amplitude 
V, and this depends on the observations 2(t) only through 


2(t)p(t)dt. 


0 
The method ‘is then extended to estimate x(T+T;) 
when x(t) is the sum of a finite number of signals V’,g,(t) 
with known shapes g,(t) and amplitudes V, described by a 
known joint probability density function, under the proviso 


the statistic 
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LOMNICKI, Z. A. & ZAREMBA, S. K. (Boulton Paul Aircraft Ltd., Wolverhampton) 


that the g,(t) are linearly independent. Again one substitutes 
into x(T+T,) the best estimates of the amplitudes V;. 
The case where x(t) is an arbitrary second-order continuous 
stochastic process with known covariance M(t, s) is treated 
as a limiting case of this problem. The functions g,(t) are 
replaced by the eigen-functions of the covariance M(t, s), 
and one estimates the coefficients of the expansion of x(t) 
in a series of those eigen-functions. The result in this 
case is necessarily left in the form of an infinite sum. The 
expected square-error is calculated for the case where 
x(t) is the sum of a finite number of signals. The author 
discusses the relation of his treatment to an earlier approach 
to the problem given by Laning and Battin, emphasising 
by an example the necessity of careful handling of the limit 
processes involved. The connection with the problem of 
estimating the coefficients of a linear regression is pointed 
out, 


(C. W. Helstrom) 


10.6 (-.-) 


Bandwidth and resolvability in statistical spectral analysis—In Engiish 


J. R. Statist. Soc. B (1959) 21, 169-171 (7 refs.) 


The purpose of this paper is to refute the latest objections 
‘ of Grenander to the author’s work, and to clarify the distinc- 
‘tion between resolvability, bandwidth, and bias. In their 
original paper [¥. R. Statist. Soc. B (1957) 19, 13-37; for 
discussion, see pp. 47-63], the authors referred to Grenander’s 
“uncertainty principle”? [Ark. Mat. (1951) 1, 503-531], 
namely the theorem that the product A,A, has a positive 
lower bound, where Ae is a second moment of the weighting 
function used to average the periodogram, and Ia is the 
limit of the product of the sample size and the variance of 
the estimator. ‘They criticised Grenander’s interpretation 
of A, as a measure of “lack of resolvability”’, and of A, 
as being ‘‘ asymptotically proportional”? to the standard 
_ deviation of the estimator. The authors interpreted 
“‘ resolvability ”’ as being the ability to estimate differences 
in spectral densities at neighbouring frequencies; and since 
(according to their criterion) the derivative of the “ best 2d 
estimator of the density is the ‘“‘ best’ estimator of the 
derivative, they concluded that resolvability and statistical 
reliability are not antagonistic. 

Grenander contributed briefly to the discussion on the 
above paper, and later expanded his remarks—which were 
written before the authors’ reply to the discussion was 
published—[¥. R. Statist. Soc. B (1958) 20, 152-157]. He 
rejected the authors’ heuristic argument that the uncertainty 
principle as stated would lead to the conclusion that the 
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periodogram itself is a good estimator, since the conditions 
of the theorem are not satisfied by the Dirac weight function 
which is then required. He pointed out that the “ principle ” 
could be proved in several forms, and reaffirmed that 
“resolvability (bandwidth, bias) and statistical realiability 
are antagonistic’. Finally, the authors’ results applied to 
estimators which could only be constructed when the 
spectrum was known, which does not correspond to any 
practical problem. . 

In the present paper, the authors again distinguish the 
concepts of resolvability, bandwidth and bias. "They had 
never disputed the fact that bandwidth and variance oppose 
one another and had introduced the Dirac function only 
as a limiting case. Grenander now seemed tobe using 
“bandwidth ”’ in a sense different from that of his 1951 
paper; his final point was not relevant, and was the more 
surprising since he had himself [with Rosenblatt, Statistical 
Analysis of Stationary Time Series (1957), Wiley] proposed 
to optimise estimators by a method depending on the true 
spectral density. 

The authors point out an error in Grenander and 
Rosenblatt’s derivation of the bias of the truncated periodo- 
gram. 


(C. L. Mallows) 
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KRAMER, H. P. (Bell Telephone Labs., Murray Hill, N.J.) 


Symmetrisable Markoff matrices—In English 
Ann. Math. Statist. (1959) 30, 149-153 (2 refs.) 


This paper deals with the equilibrium or stationary prob- 
abilities of Markoff processes in continuous time having 
finitely many state values. The state probabilities of the 
process, {p,(t)}, satisfy the differential equations 


N 
dpjdt = & Ov), i=1,..., N. 
j=l 


The coefficients {Q,;} are independent of time, and are the 
transition rates between states; they satisfy the condition 
N 
PL OFT, —— 0. 
The equilibrium probabilities of the process are 
7; = lim p,(t); if they exist they satisfy 2 pe Ours = (0), 
to 


with flim = 1. Equilibrium probabilities find much use 
in applications; for examples see Feller, An Introduction 
to Probability Theory, Chapter 17. The author observes that, 
in general, the equilibrium probabilities cannot be explicitly 
represented as simple functions of the Q,,’s. 

Exceptions are the birth-and-death processes; in these 
processes transitions occur only from 7 to 7+1 (with rate 
\;) or from 7 to i—1 (with rate p,), and 
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KRAMER, H. P. (Bell Telephone Labs., Murray Hill, N.Y.) 


Symmetrisable Markoff matrices—In English 
Ann. Math. Statist. (1959) 30, 149-153 (2 refs.) 


“loop condition ”’ holds: the product of matrix elements 
around a closed loop is independent of the direction of loop 
traversal. Furthermore, if the above 1,(~) symmetric 
matrix has graph consisting of a single connected set, then 
there is a unique sequence p (with respect to which the 
matrix is 1,() symmetric) such that Z}_,y;1=1, w,>0. 
The equilibrium probabilities are given by 7; = p;— 1 and 
are shown to be given by a quotient of products of matrix 
elements very similar to that for the birth and death processes. 

Rate of approach to the equilibrium probabilities is 
discussed for 1,(j4) symmetric matrices. All characteristic 
values of such matrices are non-positive, zero being the 
largest value. It is shown that the difference between 
p(t) and 7; is maximised by a function of t and the initial 
probabilities {p,(0)}. An example is given. 


(D. P. Gaver, Jr.) 
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10.1 (10.0) 


This paper discusses a class of Markoff matrices Q = [Q,;] 
which includes the birth-and-death matrices and allows a 
representation similar to that immediately above for the 
birth-and-death probabilities. This is the class of 1,()- . 
symmetric matrices with connected graph. The required 
definitions follow. Q is an N by N matrix. The graph 
G(Q) associated with Q consists of vertices V,, ...Vxy and 
of directed line segments joining certain of them. The 
directed line segment from V,; to V;, V,V;, belongs to 
G(Q) if and only if Q;;40. Denote by [k,kp...k,] the 
product of matrix elements Qkyko: Qkoks---Ok, she Ve is 
connected to V; if there exist indices k,, ks, ...k; such that 
[jR,...k@]40. A graph G is connected if (a) in case 
VV;eEG, then V;V;EG, and (6) every pair of vertices is 
connected by at least one chain of lines. Let » be a sequence 
of N positive numbers; 1,() is an inner product space 
of sequences of N complex numbers, with inner product 


N 
(x,y) = 2 edit 
i 


Every N by N matrix is an operator on 1,(u). A matrix A 
is 1,(~) symmetric if (Ax, y) = (x, Ay) for every x and y 
in 1,(4). It is shown that a matrix with non-negative off- 
diagonal elements is 1,() symmetric if and only if the 


10.1 (10.0) 
continued 
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(Australian National University, Canberra) 


10.9 (10.1) 


The survival of a mutant gene under selection—Jn English 


J. Aust. Math. Soc. (1959) 1, 121-126 (7 refs.) 


The author discusses the problem of survival of a single 
mutant in a finite genetic population of 2 haploid individuals 
in which generations overlap, the mutant gene having a 
selective advantage. 

Selection is first assumed to act through the varying of 
lifetime distributions, the mutant gene a having a negative 
exponential distribution with parameter Ay 1 while the non- 
mutant gene A has a similar distribution with different 
parameter Ay ‘]; individuals born as replacements are chosen 
at random from the gametic output of the previous generation 
so that the probability of their being a or A is k(2n)— or 
(2n—k)(2n)*. The state of the population is then given 
by the number k(= 0, 1, 2, ..., 2”) of genes a at those 
discrete times at which a birth-death event occurs, and the 
Markoff chain embedded in the continuous process is 
defined by the transition probabilities 


Pr, ra = Agk(2n —k)(2n) {gk +4 (2n —k)}, 
Pr, roi = AyR(2n—R)(2n)'{Agk +A, (2n—k)}, 
Pee = 1—DPi, 21 —Pk, e412 


If, however, selection is introduced by the relative 
reproductive powers of a and A, these being in the ratio 
[y/{t2 so that the probability of birth of a gene a or A is 
PaR{ pak + po(2n—k)}™ or po(2n—k){pyk + p2(2n —k)} > while 
the lifetime distributions are the same negative exponential, 
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NASR, S. K. (University of Alexandria) 
On some problems of machine interference—In English 
J. R. Statist. Soc. B (1959) 21, 106-113 (10 refs.) 


After a brief discussion of the work by Khinchine, Ashcroft, 
’ Palm, Naor, Benson, and Cox, the author considers Palm’s 
solution as quoted by Feller [Introduction to Probability 
Theory 1 (1957), New York, Wiley]. For the case when 
m machines are attended by 7 operatives, an expansion is 
given for the servicing factor which is shown to be consistent. 
The expression for machine efficiency given by Naor in 
Poisson functions is then converted to binomial and Poisson 
cumulative functions: the author remarks that this may 
ease computation. An expression is given for the probability 
that there are some machines waiting to be serviced. 

The author goes on to discuss the numerical results 
found in the literature and states that they may not seem 
to be compatible; an example is given. He then finds a 
limiting value of the machine efficiency as the number of 
machines increases indefinitely, while the number of machines 
attended by an operative remains fixed. ie 

Finally, the effect on a loss function of assigning ancillary 
work to operatives is discussed. The loss function 1s used 
to obtain lower and upper bounds for the number of operatives 
required. : 


(aL McGloughlin) 
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the embedded Markoff chain of the process has transition 
probabilities also of the form shown above, with 4, pL 
replacing 4,, A, respectively. 

Since the ratio px, 441/Px, x1 in both models is constant, 
being ;/A, and p/p. respectively, the problems are seen 
to be identical. On embedding a secondary chain in the 
previous ones by considering only those jumps for which k 
changes its value, so that pz, x1 = 1—7, Per = 0, Dr, na = 
a, Where 7 = A,/(Ay+A,) or py/(u1+ 2), the models reduce 
to the classical gambler’s ruin problem. 

Writing 7 = (1+«)(2+«)+ where « is the selective 
advantage, the probability P,, of survival of the gene a 
is thus seen to be 


Pay = (1-+a)*"-# —(1 +a) ?"/1 —(1 +0) 


ky being the initial number of genes a. For « = O(n7) 
and n large, P.,~ak,)(1 —e-?*”)+4, a result similar to Fisher’s 
[The Genetical Theory of Natural Selection (1930) Oxford] 
approximation P,, = 2a(1 —e-*¢”)— for ky = 1 when gener- 
ations do not overlap. 

A comparison is made with previous results for associated 
models, and the need for a solution to the parallel diploid 
population model stressed. 


(J. Gani) 
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ONICESCU, O. (Bucharest University) 


10.0 (1.0) 


On processes of a field of events and on the corresponding random function—In French 
Bull. Math. Soc. Sci. Math. Phys. R.P.R. (1958) 2, 95-108. 


The aim of this paper is the construction of a stochastic 
process by the synthesis of its two basic components: a 
somatic process, and a measure. The somatic process is 
a non-stochastic process of a field of events depending 
upon time in a determined manner. ‘To this set-theoretical 
aspect of the somatic basis of the process, a numerical 
aspect is associated considering certain numerical functions 
determined under suitable conditions. The measure, 
defined on the events of that field, is also varying in time, 
yielding the probabilities of the process. 

For a finite field K, the events of the process are formed 
by associating every event A of K to a time instant t, starting 
from an initial moment. The elementary events of the 
process are constituted by temporally ordered sets of the 
elementary events, corresponding (according to a determined 
law) to every time instant t. The fundamental events of 
the process are formed by associating an event A of the 
field to a determined value of t, while the total event is 
associated to each other event. Events, formed by associating 
an event A to a time instant t, while an elementary event 
is associated to the initial moment, are also characterised. 
A set-theoretical Chapman composition law is outlined. 
By making use of this composition law, a system of prob- 
abilities is associated to the field yielding in this way a 


505 


ONOYAMA, T. (Kyushu University, Japan) 
Note on random distributions—In English 


simple Markoff chain. The set functions defined on the 
non-probabilised event-field verify a non-linear functional 
equation recalling the equation of branching chains. 

In the second part of the paper, starting from an event- 
field K defined on a basic space, a somatic process in time 
is constructed, by a method generalising the previous one. 
This becomes possible by making use of an integral operation 
concerning set-functions with values in a set-field. In this 
manner we are led to an event-composition law and similarly 
to the composition law of a system of probabilities. 


(R. Theodorescu) 


; 10.6 (10.0) 


Mem. Fac. Sci. A, Kyushu (1959) 13, 208-213 (5 refs.) 


The random distribution introduced by Ito [*‘ Stationary 
random distributions,” Mem. Coll. Sci., Kyoto, A (1954) 
28] and Gelfand [‘‘ Generalised random processes,”’ Dokl. 
Acad. Nauk. SSSR (1955) 103] is an H (Hilbert) space- 
valued continuous linear functional defined on the space 
of basic functions D; where D is the set of all complex- 
valued C-functions defined over R(—©, ©) and with 
io.6) 


compact carriers. 

The stationary stochastic process finds its natural 
extension in the random distribution and the spectral 
representation of the stationary process has its analogue for 
the stationary random distribution. This spectral represent- 
ation of the stationary random distribution is validated 
through the Fourier transforms of the basic functions. 

Thus the extension of the space of basic functions will 
enable us to extend the definition of the stationary distribu- 
tion. Inthe paper by Gelfand & Silov [“‘ Fourier transforms 
of rapidly increasing functions and questions of uniqueness 
of the solution of Cauchy’s problem,” Uspekhi Math. Nauk. 
(1953) 8] a foundation for the Fourier transforms of rapidly 
increasing functions was given. In the present paper the 
author defines the random distribution on the space K,(p> 1), 
the space of functions ¢(x) which satisfy the inequality 
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| (x) | SC, exp (—C | x |”) for some constants C, and C 
(depending on the function) together with each of their 
derivatives. The notion of the stationary distribution is 
then extended by using the results of Gelfand and Silov. 
The possibility is suggested of applying the results to the 
non-homogeneous stochastic linear translatable equations 
discussed by the same author [“‘ Linear translatable stochastic 
functional equations and random distributions,” Mem. Fac. 
Sci. A, Kyushu (1956) 10). 


(H. Akaike) 
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PRAPORGESCU, N. 
On forming chain probabilities—In Rumanian 
Studi si cercetdri mat. (1958) 9, 439-480 (7 refs.) 


This paper deals with the asymptotic behaviour Genteriine 
stationary and non-stationary Markoff chains and chains 
with complete connections (notion introduced by Onicescu 
and Mihoc). This is done by making use of certain special 
systems of integral equations studied by the author in a 
previous paper [Bul. stiintific al Acad. R.P.P., Sectiunea de 
stunte matematice si fizice (1956) 8, 549-616] as well as of 
the characteristic function method. 

For the simple stationary chains, the general form of 
the transition probabilities is deduced. By making use of 
certain properties of the roots of the corresponding char- 
acteristic determinant the asymptotic values of the transition 
probabilities are deduced. In this way results given 
previously by Fréchet were rediscovered. Moreover, these 
results hold good for continuous and multiple chains. 
Furthermore, under suitable conditions the non-stationary 
chains have a similar asymptotic behaviour as in the station- 
ary case. Analogous results are also deduced for chains 
with complete connections. The last part deals with slowly 
convergent distribution functions ; it is shown that the 
expression of the corresponding asymptotic characteristic 
. function generalises to the Poisson distribution function. 

As starting point, the author took the monograph of 
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PRAPORGESCU, N. (Inst. Math., People’s Republic Academy, Bucharest) 
On temporally variable probabilities forming a chain—In Russian 


Rev. Math. Pures Appl. (1959) 4, 403-423 (7 refs.) 


This paper is the sequel of a previous one published by 
the author in Studzi st cercetdri Matematice (1958) 9, 439-480. 
Non-stationary simple Markoff chains with discrete as well 
as with continuous set of states are investigated. By means 
of similar methods as used in the quoted paper, expressions 
for the solutions of the corresponding Chapman-Kolmogoroff 
equations are given. These expressions are used in order 
to characterise the asymptotic behaviour. It is stated by 
the author that the results hold true for multiple chains. 


(R. Theodorescu) 
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(Institute of Mathematics, Bucharest) 


10.1 (1.0) 


Onicescu and Mihoc ‘‘ Les chaines de variables aléatoires. 


Problémes asymptotiques,” Etudes et recherches de Académie 
Roumaine (1943) 14. 


(G. Simboan) 
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RENYI, A. (Eétvés University, Budapest) 10.1 (2.5) 


On serial and parallel coupling of autoclaves and on the theory of mixing—In Hungarian 
Publ. Math. Inst. Hung. Acad. Sci. (1959) 4, 155-165 (1 ref.) 


The problem discussed in this paper arises from a situation 
where a material is mixed in an autoclave. The mixing is 
called ideal if the distribution of the time which a particle 
spends after some moment ¢ in the autoclave does not 
depend on the length of the time which the particle has 
already spent there. This condition means that the dis- 
tribution of the length of time which a particle spends in 
the autoclave is an exponential distribution. The author 
deals with the following problem: how to switch a given 
number of autoclaves if we want to maximise the percentage 
of the particles which remain more than ¢t) hours in the 
system of autoclaves and if we want a given amount of 
material to flow through the autoclaves during an hour. 

In the second part of the paper the problem of non- 
ideal mixing is discussed. The ratio i = D*(¢)/M%Eé) is 
called by the author the “‘ intensity of mixing”’: £ is equal 
to the time which a particle spends in the autoclave, M(é) 
is the mean value of D?(é) the variance of the random 
variable €. In the case of ideal mixing 7 = 1 and if there is 
no mixing at all, then z= 0. If the probability that a 
particle leaves the autoclave after spending a certain amount 
of time («) there is a monotonically non-decreasing function 
of x, then the mixing is called effective. It is shown that 
in case of effective mixing log 1/{1—P(é<x)} is a convex 
function, which implies 7X1. 


(P. Révész) 
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REVESZ, P. (Eédtvés University, Budapest) 10.0 (1.5) 
A generalisation of the zero-one law—IJn English 
Ann. Univ. Sci. Budapest. Sect. Math. (1959) 2, 111-113 (4 refs.) 


The zero-one law and the Borel-Cantelli lemma are well- 
known in the theory of probability. The validity of these 
theorems is usually proved only for mutually independent 
random variables or—in the case of the Borel-Cantelli 
lemma—for mutually independent events. In this paper 
the following interesting question is posed: is the condition 
of independence necessary or not in these theorems? Erdés 
& Rényi in their paper ‘‘ On Cantor’s series with convergent 
21,,” [Ann. Univ. Sci. Budapest. Sect. Math. (1959) 2, 
93-109] proved that the assertion of the Borel-Cantelli 
lemma remains true if we suppose only that the events are 
pairwise independent. The author deals with the corres- 
ponding problem in the case of the zero-one law. ; It is 
shown by a simple counter-example that the assertion of 
the zero-one law is not true generally for pairwise independent 
random variables. However, the following theorem is valid: 
Let {Q, FY, P} be a probability field and let &, &, ... be 
an ergodic Markoff-chain. Let SY, denote the smallest 
Borel-field with respect to which €,, é,41, -.. are measurable 
and put 77, =S*. Thus, if deS*, it follows that 
P(A) = 0 or P(A) = 1 and if 7 is a random variable 
measurable on the sample space “*, it follows that there is 
a constant c such that A(n = c) = 1. 


(J. Mogyorédi) 
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RUBIN, H. & TUCKER, H. G. (Oregon Univ., Portland and Univ. California, Riverside) 


10.2 (10.1) 


Estimating the parameters of a differential process—In English 


Ann. Math. Statist. (1959) 30, 641-658 (2 refs.) 


Let X denote a differential process, that is to say a stochastic 
process such that the increments of the process in any time 
interval have a distribution that depends only on the length 
of the interval, and not otherwise on time or on changes 
in any other time interval. The Poisson process is a simple 
example. It is known that the probability distribution of 
such a differential process is a convolution of a normal 
distribution and a possible infinite number of Poisson laws. 
The distribution of X(¢) is infinitely divisible and hence the 
logarithm of the characteristic function can be written as 


\ +0 
log f(u) = iytu+t | {el —1 —iux/(1 +2)}(1 +x?) |x?dG(x). 


Here y is a trend term while the saltus of G at 0, i.e. 
G(+0)—G(—0), is the variance of the normal component. 
Furthermore, the jumps of G (which is non-decreasing) 
give information as to the Poisson components. 

This paper gives two estimates of this characterising 
function G:: the first of these require the observation of the 
sample function at any finite number of time points. The 
second requires in addition that it also be possible to observe 
every jump of the sample function over a finite interval. 
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, | 
SAKINO, S. & HAYASHI, C. (Institute Statistics Mathematics, Tokyo) 


In this case it is also possible to estimate the variance of 
the normal component. The first estimate may be biased 
while the second estimate is unbiased over all intervals 
which do not contain zero. 

Finally an unbiased estimate is given for the trend 
term y: this is based upon complete observation of the 
process for the interval [0, 1]. 


(D. G. Chapman) 


10.9 (11.7) 


On the analysis of epidemic model I (Theoretical approach)—In English 
Ann. Inst. Statist. Math., Tokyo (1959) 10, 261-275 (7 refs., 6 figs.) 


This paper considers the construction of epidemic models 
in connection with communities when the assumption of 
homogenous mixing is satisfied. The chance of contact 
is assumed to be the same for any pair of one infective and 
one susceptible member but not so the chance of attack 
for all contact members. A further simplifying assumption 
is that the latent period be reduced to zero by comparison 
with the infection period. 

In section 2 and 3 of the paper a stochastic epidemic 
model is constructed and examined numerically by Monte 
Carlo methods. A diagram is given summarising the relation- 
ship between the three elements of the population at risk: 


‘susceptibles, infectives, and removals, and probabilities 


connecting these elements in the time interval (¢, t+A?). 

In section 4 of the paper the stochastic model is enlarged 
by introducing the concept of the conditional probability 
by which a person infected at time 7 will recover in the 
time interval (t, t+At). This overcomes the disadvantage 
of the model outlined in section 2 where the recovery of 
an infected person is independent of the infected time. 

The paper concludes with a reconsideration of some 
points in the classical (deterministic) treatment of these 
problems. A further paper is envisaged which will examine 
the validity of the stochastic models developed in this paper 
using data from influenza epidemics. 


(M. Siotani) 
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SCHMETTERER, L. (Inst. fiir Versich.-math. und Math. Statist., Hamburg) 


10.5 (5.0 
Report on the literature on information theory—In German - 


— LEE 


Bld. Dtsch. Ges. Versich.-math. (1960) 4, 259-266 (76 refs.) 


The object of this report with its extensive bibliography is 
to provide a survey of the various developments that can 
be traced in information theory. Such a survey is also 
important on account of the fact that information theory 
plays a role in the theory of communication as well as in 
mathematical statistics, and the designations are not always 
used in the same way. A certain amount of mathematical 
knowledge is, however, required in order to read the report. 


After a brief discussion of the fundamental contributions | 


of Shannon and the enumeration of several works dealing 
with the technological aspects of the problem, the report 
is concerned in detail with the mathematical difficulties of 
information theory. Some of Shannon’s concepts had to 
be formulated more precisely or even modified, in order to 
make it possible to give a correct proof of his theorems. 
The extension of information theory to stochastic processes 
(Gelfund, Jaglom) and various other generalisations are 
also discussed. 

Finally the application of information theory in mathe- 
matical statistics is treated. 


(K. Prachar) 


RY I) 


STEUTEL, F. W. (Mathematical Centre, Amsterdam) 


10.4 (2.5) 


Reduction of the mean waiting time by priority assignment-——In Dutch 


Statist. Neerlandica (1959) 13, 503-513 


This paper deals with the reduction of the mean waiting 
time by priority assignment in a one-counter waiting time 
process. 

In the first part, the situation is considered in which 
customers of two kinds arrive in independent Poisson 
streams, the service time of all customers of the same kind 
having the same distribution. Priority is given to customers 
of one kind. The mean waiting time averaged over all 
customers is obtained with use of Cobham’s formulae [ 7. 
Operat. Res. (1954) 2, 70-76] and compared with the 
mean waiting time in the case where no priorities are given. 
It is proved that, in order to reduce the mean waiting time, 
priority should be given to the customers having the smaller 
mean service time. 

_ In the second part of this paper a method is used which 
was introduced by Prins for the special case of exponentially 
distributed service times: customers arrive in a Poisson 
stream, the service times of all customers having the same 
distribution with mean uw. 
service time of an arriving customer can be estimated with 
sufficient accuracy the customers are divided into those 
having service times cy and those with service times 
>cy, c being a constant. In this way a situation is created 
identical to the one described in the first part of this paper. 
Priority is given to the customers with the smaller service 


514 


On the assumption that the’ 


times. The averaged mean waiting time is calculated and 
(on the assumption of differentiability of the service time 
distribution) minimised with respect to c. 

Some examples and graphs are added. 


(F. W. Steutel) 


STOLLER, D. S. & STOLLER, L. C. (Rand Corp. and Space Tech. Labs. 
Calculating the coefficients of certain linear predictors—In English 


: Los Angeles) 10.2 (11.1) 


Math. Tab., Wash. (1959) 13, 122-124 and 129 (3 refs.) 


This paper presents a method of obtaining the coefficients, 
in a form suitable for computation, for a problem in pre- 
diction. It is assumed that observations (Cai, CH cay BAD 
have been made at +1 equally spaced points. It is desired 
to find an estimator of the next observation, yn,1, under 
the following assumptions: 


(i) the expected value of the ith observation is a poly- 
nomial of degree k. 


(i1) the expected value of a linear predictor 


nv 

(Yn41 = 2 a,j) 

: SO 
is equal to the expected value of the next observation 
for an arbitrary polynomial of degree k. 

(iii) the observations are assumed to be independent 
and equally variable. 


The expected value of the ith observation is expressed 
as a linear function of orthogonal polynomials. The co- 
efficients of the linear predictor (a;) are determined so as 
to minimise the variance of the predicted value, nj. 
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STREIBEL, C. T. (Lockheed aperstt Corp., Palo Alto, California) 


Densities for stochastic processes—In English 
Ann. Math. Statist. (1959) 30, 559-567 (7 refs.) 


Let x@(t), 0 € 2 be a parametric family of stochastic processes 
defined by their finite dimensional distributions: that is 
Fola(t,), -.-» x(tn)]; 9 € 2 is a given n-dimensional distribu- 
tion for every finite set {(t;, fo, ..., t)}- 

One procedure for solving the statistical problems con- 
cerning @ (for example estimation of @) is to solve the problem 
for the finite dimensional family. If the solution so found 
has a limit in some sense for some suitably chosen limiting 
operation of the sets {t,, tz, ..., tn} and is independent of 
the defining set, then the statistical problem is solved for 
the stochastic process. 

An alternative procedure was proposed by Grenander 
LArk. Mat. (1950), 1, 195-277]. He introduces the likelihood 
ratio of two processes P and Q restricted to a finite number 
of time points, shows that it converges to a limit as the 
number of points goes to infinity and this limit is the density 
of P with respect to Q if it exists. If this density exists, 
and if Q can be suitably chosen, standard statistical tech- 
niques can be applied. 

The present paper gives a criterion for the existence 
of a density which proves convenient in several applications. 
A condition is also given in which a density computed for 
a countable number of time points is valid for the continuous 
parameter process. 
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Values of the a; have been computed for k = 1(1)10 and 
for various values of m. They are available from the 
Unpublished Mathematical Tables Repository of MTAC. 
The sum of the coefficients should equal unity. This is 
checked to six significant figures for degrees 1 through to 8 
and to five significant figures for degrees 9 and 10. 


(D. Teichroew) 


10.2 (10.3) 


The results are applied to a normal process with known 
continuous covariance function and unknown mean of the 


& 
form 2 k,¢,(t), ¢; known, k; unknown. The process is 
4=1 


absolutely continuous with respect to the process with 
mean zero and the same covariance. Thus a relative density 
of the general process is derived and minimum variance 
unbiased estimates found for the k;. The same method is 
used to derive the maximum likelihood estimate of the 
correlation parameter of the Ornstein-Uhlenbeck process. 


(D. G. Chapman) 


TAGA, Y. & ISII, K. (Institute of Statistical Mathematics, Tokyo) 


10.5 (10.2) 


On a stochastic model concerning the pattern of communication—Diffusion of news 


in a social group—In English 


Ann. Inst. Statist. Math., Tokyo (1959, 11, 25-43 (6 refs., 3 figs.) 


A stochastic model is treated in which certain information J 
is communicated from the information source to persons, 
or from a person to another in a system consisting of a 
social group 7 and an information source SS. 

The social group 7 consists of M persons, and the 
information source S has as a a function the constant trans- 
mitting of certain information J to persons in 7 during 
the period of observation. It is assumed that none of the 
persons in 7 have the information J at time t = 0, and J, 
starting from S, is communicated from one person to 
another in the course of time. In the time interval (¢, t+ Az) 
for sufficiently small At, the transmission of J from a person 
to another has the probability A . At+-o(Az), while the trans- 
mission from S to a person has the probability » . At+o(A2), 
and the probability of two or more transmissions is o(At). 

It is assumed, moreover, that all possible communications 
in the group 7 are made mutually independently, and that 
the events which occur in (t, t+At) are independent of the 
past history of the system up to time ¢. In order to char- 
acterise the diagrammatic pattern of the tree produced in 
the system up to time t, the authors introduce the random 
variables N(t), K(t) and L(t) which represent respectively 
the number of persons having received I up to time ft, the 
number of persons who have received J but not yet com- 
municated it to any other person up to time t, and the 
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URBANIK, K. (Math. Inst., Polish Academy of Sciences, 
An effective example of a Gaussian function—In Polish 
Bull. Acad. Polon. Sci. 111 (1959) 7, 343-349 


Let E be a Lebesgue measurable subset of the positive 
half-line. The set E is said to be relatively measurable, if 
the limit 
FN res ee rs I | Ent: 0<t<T}| 
To 


exists (by | Z | we denote the Lebesgue measure of Z). 
For every interval I = {t: aS<t<b}(0Sa<b) and for every 
real-valued function f(t) defined on the positive half-line 
we shall use the following notation: 


f (7) =f ()—f @, +t = {utt: uel}. 


We say that a continuous function f(t) defined on the 
positive half-line is a Gaussian function if for every system 
of disjoint intervals I,, In, ..., I, and for every system of 


k 
real numbers x1, %2, .... x, the set O {t:f *([; +t) <x} is 


j=2 
relatively measurable and 
:f *7,4+pn< — a 
a fi ( wt ) 3} R j=l VI Te | 
1 £ 
— ee“ du. 
where d(x) 5s { oa 
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Warsaw) 


number of persons having received J directly from S up 
to time ¢. Then it is proved that the conditional probabilities 


Pax = Pr{K(t) = k | N(t) = n}, 
Pay = Pr{L@) =1| NW) =n} 


are independent of time t and size M of the group 7. 
Further, the conditional expectations 


E,(K) = E{K() | N@ = n}, EL) = EL | N@ = 2}, 


and the conditional variances V,,(K) of K(t) and V,(L) of 
Lit), given N(t) = n, are calculated. These results make it 
possible to test the assumption of homogeneity of com- 
munication in group 7, or to estimate the parameter § = p/A 
which represents the ratio of intensities of two communica- 
tions, namely, from S to a person and from one person to 
another. 

It is also proved that normal approximation is valid 
for the conditional distributions {p,,,} of K(#), {Dna of 
L(t), and that the bi-variate statistic (N(t), L(t)) is a sufficient 
statistic for the family of distributions Pr{U(t); A, »} with 
the parameters (A, ,). 

Further, it is noticed that among various information 
sources S; the most effective one is obtained by comparing 
the estimates of the parameters 5; = p,/A corresponding 
to S;, given the same information in the social groups. 


(Y. Taga) 


10.0 (10.1) 


It is well known that almost all realisations of the normalised 
Brownian movement processes are Gaussian functions. In 
the note an effective example of a Gaussian function is 
given. 


(L. Kubik) 
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WINSTEN, C. B. (Imperial College, London) 


Geometric distributions in the theory of queues—In English 


10.4 (10.1) 


J. R. Statist. Soc. B (1959) 21, 1-22 (8 refs., 2 tables, 2 figs.) 


A new explanation in probability terms is given for the 
occurrence of geometric or conditional geometric distribu- 
tions in some queueing problems. Suppose customers 
arrive at regular intervals, and that between each pair of 
arrivals the potential number of customers served is a 
random variable which is distributed independently and 
identically for separate intervals. ‘The actual number served 
is equal to this potential number if the queue is long 
enough; otherwise the queue-length is reduced to zero 
before the next arrival. This formulation of the queueing 
situation is equivalent to specifying an infinite transition 
matrix in which, except for the first column, the elements 
in each of the diagonals are equal; also there are zeros 
everywhere above the first super-diagonal. 

Now the probability that a queue-trajectory (i.e. history 
of the queue-length for a single realisation of the process) 
which passes through the value g at time zero, passes 
through g again at time t without having fallen below q in 
between, is independent of g. Assuming equilibrium, the 
probability that any trajectory passes through g at time 
zero can be calculated as a sum of terms relating to the 
last time the trajectory passed through g—1; and this 
relation is independent of g. Hence it follows that the 
equilibrium distribution of queue-length must be geometric. 

The argument applies in other cases, for example if 
there is more than one server. ‘The transition matrix is 
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WINSTEN, C. B. (Report on a Discussion on paper by) 


as above except that now more than one initial column 
may be arbitrary. The queue-length is geometric, except 
for the initial terms. 

Next it is assumed that customers arrive late, the amounts 
of lateness being independent and identically distributed 
and being bounded by unity, so that customers cannot arrive 
in the wrong order; service-times are now exponential. 
The limiting queue-length distribution is geometric, except 
for the first term. 'The waiting-time is exponential with 
a lump at zero; the mean waiting-time is a minimum when 
the arrivals are equally spaced, and maximum when the 
lateness distribution approaches a certain binomial distribu- 
tion on zero and unity. 

The case when the lateness extends up to two time- 
units is investigated. 

Finally, if the queue-length can increase by at most k 
at each arrival-epoch, the equilibrium queue-length prob- 
abilities satisfy a Ath order difference equation. This covers 
the case of k-stage (Erlang) service. 


(C. L. Mallows) 


Editorial Note: see abstract No. 522, 10.4. 
Winsten 


10.4 (10.1) 


Geometric distributions in the theory of queues—Jn English 


J. R. Statist. Soc. B (1959) 21, 23-35. 


The contributors to the discussion were Lindley, Wishart, 
Foster, Takacs, Kiefer, Lewis, Jackson, Conolly (in writing), 
and Mallows: the author replied. 

Lindley pointed out that the usual assumption of in- 
dependent inter-arrival times could arise naturally when 
the input to the queueing system was the output from a 
previous system, and that Winsten’s method could be 
applied to this case, at least when the service-times were 
exponential. The usual technique for studying a stochastic 
process was to obtain a relation between a state-probability 
at time zero and the probabilities of all the possible states 
at some previous fixed time. Winsten’s paper demonstrated 
that other types of relation may be valuable, in terms of the 
probabilities of trajectories (of all possible lengths) through 
a fixed state. In many situations, for example in the finite dam 
problem, this change of emphasis may lead to new results. 

Wishart noted an extension of the argument to the 
system GI/E;/s, and described a Monte Carlo experiment 
relating to equidistant scheduled arrivals, exponential late- 
ness distribution, and single exponential service. 

Foster remarked that the methods of the paper applied 
also when customers could be early as well as late. For the 
system GJ/E,/1, one can imagine customers arriving in 
batches of k; with exponential service; there is an integral 
relation between the queue-length generating functions for 
this model and for the model obtained by counting only 
every kth customer. A similar approach can be used to 
solve the system E£;,/G/1. 
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Takacs presented a more general formulation of the 
queueing process, which leads naturally to a proof of the 
geometric results in the cases considered by Winsten, and 
pointed out an analogy between the M/GI/1 and the GI/M/s 
(with finite waiting-room) processes. 

Keifer mentioned the problem of simultaneous queues, 
with customers joining the shorter queue; and of customer 
impatience, where each customer leaves without being 
served if he has to wait for longer than a specified time. 

Lewis said that if the lateness distribution has a large 
variance, certain sums arising in the specification of the 
properties of the arrival process could be replaced by integrals, 
and simple results obtained. 

Jackson distinguished three classes of lateness distribution, 
namely, everybody early; everybody late; neither late nor 
early. Conolly also referred to the problem of early 
arrivals. Mallows showed that Winsten’s method applied 
also when customers may baulk. 

The author replied, and discussed the system M/M/s, 
showing the effect of customer impatience; he noted that 
a point raised by Mallows, the analogy given by Takacs, 
and some recent results on reservoirs, all follows from 
symmetries in the corresponding queue-trajectories. 


(C. L. Mallows) 


Editorial Note: see abstract No. 521, 10.4. 
Winsten 


Tet eases wow ea 


ee ee et 
is rh is : SA Nas Ate ae hoe re 6s a PER Siva oy ei Aer 
x ong DSi Sib 


Ss SN gah saat ae Peet * mes ee Pres Ree oe uykt ko 
har mot FS i aes He) ae pattie tne a Hira’ ees 
HHS . . ) tabi See actiee fade Ae a, i ed a ' : ag ee calle: Pepe 


Nt agate ey fe PS : Li , an BN 4b! an ee i. 
; NR nan ee Pac Sie teak 
. . 1S sen ella 
as n ' ‘ -4 \ a e i 
; cae ; 4 Ph et Ret ge ne Phe ie 4 Ss 


. : ~- : ; ee at ee Sb ers: on ue 


2 et Sey i ; 1a ET) oe ie Tea Deer z; 
res % Sy eo oe Bact alley Pi gp ss ees 


} gle ; ae ee LS: tia a a 


het’. 
a 4 7 7 
Ve 
P 
F 
eta gt 
i i 
es 
oie 5 
lia 
; ¥ 
* a 
‘ r 
4). a y 


WEINSTEIN, A. S. (New York State Department of Mental Hygiene) 
Alternative definitions of the serial correlation coefficient in short autoregressive 


sequences—In English 


JF. Amer. Statist. Ass. (1958) 53, 881-892 (28 refs., 4 tables) 


This review paper summarises alternative definitions of the 
serial coefficient and alternative methods of estimation are 


investigated with the aid of series of known autoregressive. 


properties. Included in the investigation are the ordinary 
circular and non-circular definitions as well as some approxi- 
mations. The bias introduced by the necessity of correcting 
for an unknown mean is considered. The series used in the 
estimation are constructed to represent a time series of 
monthly observations and estimates based upon the segment- 
ation of the series into observations for a given month 
presented. The methods employed are explained in greater 
detail in a previous paper by the same author given in the 
references. 


: 


(W. A. Fuller) 
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ZUBRZYCKI, S. (Math. Inst., Polish Academy of Sciences, Warsaw) 


Concerning Yule’s characteristic of style—In English 
Zastosowania Mat. (1959) 4, 328-331 


This note is concerned with a discussion of two character- 
istics of style used in statistical linguistics and based on 
counts of occurrences of particular words in a text. ‘The 
first of them is due to Yule [Statistical Study of Vocabulary 
(1944), Cambridge], the other to Herdan [ Zeit. angew. Math. 
Phys. (1955) 6, 332-334]. In this note, a probabilistic 
explanation is given for that these characteristics are reason- 
ably independent of the sample size. The difference between 
them is also stressed. The explanation given differs from 
those contained in Herdan’s earlier publications [see also 
Zeit. angew. Math. Mech. (1956) 36, 72 and his book 
Language as Choice and Chance (1956), Groningen]. The 
main argument is based on a relation which is also contained 
in Herdan’s later paper [¥. Appl. Math. Phys. (1958) 9, 
69-73]. 


(S. Zubrzycki) 
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BELLMAN, R. & DREYFUS, S. (RAND Corp., Santa Monica, California) 


11.5 (0.8) 


Functional approximations and dynamic programming—In English 


Math. Tab., Wash. (1959) 13, 247-251 (5 refs.) 


One of the difficulties which arise in the numerical solution 
of problems is that as the number of dimensions of the 
problem increases, the number of calculations which must 
be performed increases as a power. For example, if a 
function must be evaluated at m points in one dimension, 
it must be evaluated at m? points in two dimensions, m° in 
three dimensions, etc. The authors suggest that instead 
of computing a function value for each grid point the func- 
tions be represented by a set of orthogonal polynomials; 
for example, they might be Legendre polynomials with the 
coefficients chosen to be the Christoffel numbers. 

The following table gives a comparison of the number 
of values which must be stored if a grid of size A is used 
as compared with using orthogonal functions of degree R, 
and d dimensions, in which case (R+1)(R+2)...(R+d)/d 
coefficients would be required. Using A = 0:01 and R = 5, 
10, the number of values which must be stored is as follows: 


Dimension (d) UN) SE Ra 10) 
Oe 6 11 
2 10? 2A 66 
3 10° 56 286 
4 108 126 1001 
5 LOSS 252 3003 


It can be seen that the direct method would require 
a storage capacity completely beyond that of present 
machines whereas the method of approximating by a series 
of orthogonal functions is now practicable. 
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GREEN, B. F. Jr., SMITH, J. E. K. & KLEM, L.. (Massachusetts Inst. Tech., Lexington) 


The authors apply this technique to a problem in 
dynamic programming; the problem is to determine {v,} 
so as to minimise the function 

N N 
Z utr = vi 
k=1 k=1 
where (4) tp.4 = Quan —uezt+uny Vy = (6 
(5) v, must be chosen so as to keep unyy 
within the interval [—1, 1]., 
Let fy(c) be the value of Fy obtained when the minimum 
values of the {v,,} are used. 
fn(c) = Min [c?+Av2+fy_,(2c —c? +-v)] 
for N22? and 7,(c) = ce. 

The following table shows the agreement between the 
grid technique and the orthogonal polynomial technique 
for the case N = 6; f.(c) is determined by using the grid 
and f (0) by using the Legendre polynomials. 


Fy = 


c KO) foc) 
1:0 1-782 ilop/7/ 
0°8 1:370 1:36 
0:2 0-153 0-145 
0-0 0-006 0-0 

—(0:2 0-202 0-20 
—0°8 4-876 4-89 
—1:0 8-666 8:67 


The agreement is regarded as quite good. 


(D. Teichroew) 


11.5 (2.6) 


Empirical tests of an additive random number generator—In English 


J. Ass. Comp. Mach. (1959) 6, 527-537 (8 refs., 8 tables, 1 fig.) 


Most of the presently used procedures for generating 
random numbers on digital computers are based on the 
multiplicative process; e.g., 

X; = aX;_,(mod 2’). 


In this process one random number is obtained from the 
previous one by one multiplication. This procedure has 
been studied extensively and has been found satisfactory. 
However, it may be possible to reduce the required com- 
puting time by using, instead of a multiplication, an addition, 
€.g., 
X; = (X;1+X;_,) mod 2’. 

In this method, the storage is required for n numbers but 
each new random number is obtained by a single addition. 
This usually represents a decrease in computing time; the 
exact amount will depend on the particular machine that 
is used. 

This paper discusses the additive method. The length 
of the period will depend on 7, r and the initial set of n 
numbers that are used. The period is sufficiently large 
for practical purposes. A number of tests were run on 
numbers generated by this process to determine their value 
as random numbers. The tests included the frequency test, 
the “ poker ”’ test, serial correlation test and the run test. 
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The process passes the frequency, poker and serial cor- 
relation tests but fails the run test for 7 less than 16. It 
passes the run test for 2 = 16 and presumably for n>16. 
If alternate numbers are discarded then the additive process 
passes the run test for m = 6 and presumably for larger n. 
The tests were conducted for r= 15 and 35 using the 
Lincoln Laboratory’s Memory Test computer and for 
y = 35 using an IBM 704. The authors conclude that 
this procedure can be used, provided the deficiency on the 
run test is taken into account either by dropping alternate 
numbers or by using 7 = 16. 


(D. Teichroew) 
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HEALY, M. J. (Rothamsted Exp. Station, England) 11.5 (9.0) 
The analysis of field experiments on an electronic computer—In English 
Biom. Zeit. (1959) 1, 210-214 (6 refs.) 


The paper deals with some experiences obtained in the 
analysis of field experiments by use of the Elliot/NRDC 401 
computer at Rothamsted. The computer is of medium size 
and has a storage capacity of 2944 words (of 32 “ bits” 
each). Input and output are on 5-hole punched paper tape. 
The first part of this paper treats the input, by means of a 
simple example and the author shows how to proceed in 
the case of a larger amount of input data and how to use 
the computer for preliminary computations. 

In the second part (Analysis) some programmes in use 
at Rothamsted are mentioned (Randomised Blocks, Latin 
Squares, and those for factorial experiments). To avoid 
“ overflow ’’, the author recommends the use of a working 
mean. In the case of a 2” factorial experiment, Hartley’s 
method of working with deviations from expected values 
turned out to be very useful. A valuable check consists in 
the computation of residuals and the residual sum of squares. 

The third part describes the output. The use of head- 
ings and the description of the analysed variables for the 
layout of the output data are recommended. At the end of 
the paper some special features are discussed. In the case 
of missing values, an efficient iterative method due to Healy 
and Westmacott is mentioned. 


(J. Roppert) 
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MORTARA, G. (University of Rome) 11.8 (2.1) 
Intensity index for international trade between two countries—In Italian 


Industria (1959) 17-26 (1 ref., 3 tables) 


In this article the author exposes the methodological problem 
of the calculation of an index of the relative intensity of 
the trade between two countries in relation to their world- 
trade. 

He presents many kinds of indexes calculated among the 
European Common Market countries. 

These indexes are ratios of the values of the trade 
between two countries to the values of the trade between 
the considered country and the other countries of the world, 
the value of the trade having been calculated with reference 
to the population. The above indexes are called unilateral 
indexes by the author and the reference to the population 
can be replaced by reference to other factors, e.g. national 
income or territory. 

With this procedure we have two comparative indexes 
one of them measuring the intensity of the trade of a country 
with reference to the trade of this country with all other 
considered countries. It is also possible to calculate bilateral 
indexes as the average of two unilateral indexes. The 
calculation of a bilateral index is made with the geometric 
mean because the unilateral indexes are ratios. 

Referring to bilateral indexes the author says that it is 
very difficult to identify them with a definite economic 
meaning. 


(P. Bandettini) 
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MOSHMAN, J. (Corporation for Economic and Industr. Res., Arlington, Va.) 
The application of sequential estimation to computer simulation and Monte Carlo 


procedures—In English 


11.7 (4.7) 


J. Ass. Comp. Mach, (1958) 5, 343-352 (10 refs., 1 fig.) 


This paper describes some of the sequential estimation 
techniques that could be used to terminate sampling when 
electronic computers are used for random sampling. In the 
past, very large samples have usually been required in order 
to obtain estimates with reasonably small variances. <A 
logical method of reducing average sample size is to consider 
a technique of sequential estimation to provide for continual 
machine sampling until a sufficiently large sample is accumu- 
iated so that one obtains an estimate with a predetermined 
variance, coefficient of variation or confidence interval. 

The methods presented here have all been used in the 
literature. They include the following: 


(i) Stein’s two-stage sampling technique [Ann. Math. 
Statist. (1945) 16, 243-258] can be used to estimate 
the mean of a normal distribution having ‘unknown 
variance; for a given initial sample of size No, a 
second sample of size N—WN, is obtained so that 
the confidence coefficient is at least 1—A (A being 
specified in advance). The present author has 
given an objective rule for choosing N, [Ann. Math. 
Statist. (1958) 29, 1271-1275]. Anscombe [¥%. R. 
Statist. Soc. B (1953) 15, 1-21] suggested a second- 
order asymptotic sequential procedure. 

(ii) Girshick, Mosteller and Savage [Ann. Math. Statist. 
(1946) 17, 13-23] reported on a sequential procedure 


YA 


MOTOO, M. (Institute of Statistical Mathematics, Tokyo) 


for estimating the parameter p of a binomial popula- 
tion. Stockman and Armitage [Suppl. #. R. Statist. 
Soc. (1946) 8, 104-112] have provided the means 
of using this procedure, which are feasible in, and 
only in, the framework of a computer for most 
practical problems. 

(iii) Anscombe [Biometrika (1949) 36, 455-458] suggests 
a procedure for constructing a boundary for a 
sequential process in which the mean 6, which has 
a variance V(@) which is a finite function of @, 
is estimated with fixed variance or coefficient of 
variation. 


An example is given to compare the Stein two-sample 
procedure with Anscombe’s sequential procedure. 


(D. Teichroew) 


Editorial Note: see abstract No. 105, 8.3. 
Moshman - 


11:70) 


Some evaluations for continuous Monte Carlo method by using Brownian hitting 


motions—In English 


Ann. Inst. Statist. Math., Tokyo (1959), 11, 49-54 (3 refs.) 


This paper treats the so-called 3-truncated spherical process 
[see also: Muller, Ann. Math. Statist. (1956) 27]. By this 
process, Muller has constructed an estimator of the solutions 
to Dirichlet’s problem. ' 

The main part of this paper is devoted to the evaluation 
of the upper bound of the mean square error of the estimator 
and that of the mean number of steps needed for this 
process. Assuming the boundary and the boundary 
function of Dirichlet’s problem are sufficiently smooth, 
these mean values can be replaced by the mean sojourn 
times of certain Brownian hitting motions which are given 
by a random changing of the time scale from Brownian 
motion. By evaluating these mean sojourn times, which 
are solutions of certain differential equations, the author 
obtains the desired upper bounds. 


(H. Akaike) 
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PANIZZON, G. (Institute of Statistics, University of Padua) 
Demand elasticity of some foodstuffs according to Italian experience—IJn Italian 


11.0 (-.-) 


Riv, Int, Sci. Econ, Comm. (1959) 6, 1148-1177 (26 refs., 10 figs.) 


The methodological interest of this article consists in the 
method used for the construction of the dynamic demand 
curve, based on market statistics. 

Budget equation 2 p,,; q:,, = ai(D1,; = prices; os 
quantities demanded; a; = income; t = time) is taken as 
a starting point. It is noted that this equation would be 
“solvable”’ if all prices and all quantities, referring to a 
given period of time, could be expressed in terms of one 
price and one quantity. 

This is maintained to be feasible whenever, over relatively 
short periods of time, the ratios between the price of one 
item to that of any other can be shown to be approximately 
constant. According to the author, this happens to be the 
case in Italy for the nine items considered during the 
periods 1891-1899 and 1950-1956. Actual prices are then 
substituted by new ones (D;, = calculated prices) on the 
basis of a strictly constant prices ratio. On the other hand, 
the theory of Pareto (on the standard of living) leads the 
author to some expressions in the form f,(q:,1) = 41,; 
associating the quantities of the various goods to the quantity 
of one item (e.g., 9,1). 
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SCHWARZ, K. (Statistisches Bundesamt, Wiesbaden) 


Index numbers in migration statistics—In German 


Alig. Statist. Arch. (1959) 43, 17-34 (4 refs., 3 tables) 


The author distinguishes three kinds of index numbers in 
migration statistics: 


(i) the mobility of a certain population, 
(ii) migration between two regions, or 
migration streams ” e 
(iii) the effect of migration on the size and composition 
of a population. 


In (i) he shows that the correct way to calculate rates 
of mobility is to divide the number of outward movements 
(from internal migration as well as emigration) by the 
number of the population “‘ exposed to the risk of migration.” 
If there is a considerable change in the size of the population 
during the period under review, the average number should 
be used. 

In order to measure the migration stream from one 
region (A) to another region (B), he proposes also to 
calculate the rate of mobility. The number of persons 
moving to (B) is to be divided by the number of people 
living in (A). This formula may be improved by taking 
into account the area of B. The author favours the formula: 


== 1GG00 as = Gn /E £)1000 


E.p 
where M = persons moving from (A) to (B) 
E = population of (A) 
p = area of (B) as a fraction of the total territory. 


“volumes of 
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? An equation is obtained from which known Pt,; and 
a, (being the interpolated values of income) together with 
the unknown value of q;,, (the reference item) are easily 
drawn: and with this all other values q;,;. 

Theoretical quantities, together with the new prices 
already calculated constitute the dynamic demand curve. 
Elasticity is measured by means of Schultz’s formulae 
[Statistical laws of demand and supply, (1928) Chicago, p. 12]. 


(O. Cucconi) 


11.8 (11.0) 


The author refers to other possibilities of measurement, 
which also take into account the size of population of (B). 
As an alternative he proposes to measure the “‘ attractive ”’ 
forces of a place (B) with respect to another place (A) by 
means of coefficients of association in contingency tables 
analogous to the concepts of social mobility. 

Considering (iii), the author briefly discusses rates which 
are based on net migration. Finally he describes the measure 
of effectiveness of migration, namely the fraction of net 
migration and the total number of externa! migrations. 


(K. Weichselberger) 
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TEDESCHI, B. (University of Rome) 


Concerning an index for measuring the change, over a period of time, 


of the mortality in a community—ZJn Italian 


G. Ist. Ital. Attuari (1958) 21, 13-19 (7 refs., 5 tables) 


The object of this work is to fix a suitable index for measuring 
the change in mortality over a period of time. 

Denoting by q(x, t) the death rate for a person of age x, 
belonging to a given community, within calendar year 1, 
the author observes that in order to compare the mortality 
of the persons of age x belonging to that community, within 
year t’, with the mortality of the persons of age x of that 
community, within year t”, any one of the following indexes 
is commonly used: 


T(x, t, t’) = q(x, t’)/q(x, t'); T(x, t’, t”) = q(x, t’)—a(x, t”); 
I,(x, t’, t”) = [9(x, t’)—q(x, t”)]/q(x, t’). 


Through the comparison of the values obtained by 
applying the said indexes to the death rates of the male 
population of Italy, over different periods of time, it is 
observed that the two indexes J, and I, may lead to contra- 
dictory results, whereas indexes J, and J, are not comparable 
because they are not homogeneous. The choice of the 
wrong index may lead to arbitrary conclusions. 

The author, referring by way of example to years 1930 
and 1950 and making successive decompositions among the 
causes which brought about changes in mortality between 
the said two years, then takes into consideration a group « 
of the causes which did not bring about any changes and 
group & of the causes which did bring about changes. 
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WARD, D. H. (National Coal Board, Manchester, England) 


11.8 (2.1) 


If g,(x, 1930) is the death rate for a person of age x 
connected with cause € alone in 1930, q,(x, 1930) is the 
analogous rate connected with cause « alone and q,,(x, 1930) 
the rate supplied by 1930 statistics, depending on causes « 
and ¢. Moreover, if in 1950 the death rate for a person of 
age x, q,(x, 1950), depends on cause « alone the result, 
considering qg,(x, 1930) as a pure probability of death 
(according to Karup), is 


Gae(x, 1930) —gy(x, 1950) 
1 —q, (x, 1950) : 


The index suggested by the author for the measurement 
of the change in mortality from 1930 to 1950 is I(x, 1930, 
1950) = q,(x, 1930), g,(x, 1930) being given by the foregoing 
expression. Such an index can obviously be applied with 
reference to two years whatever ?t’ and t”, replacing 1930 
and 1950 by ?@’ and ¢”. 

The values of this new index and of index J,, as defined 
above, resulted very similar for the ages corresponding 
with low death rates; whereas the higher the death rate 
G(x, 1950), the greater the difference between them. 


q(x, 1930) = 


(A. Del Chisco) 


11.3 (6.2) 


A nomogram for calculating partial correlation coefficients—In English 


Inc. Statist. (1959) 9, 139-141 (4 ref.) 


Presenting an alternative graphical method to that of Liddell 
[Inc. Statist. (1957) 7, 167-169], the author comments that 
although the indiscriminate use of the correlation coefficient 
is now out of fashion it is still of value to have a rapid 
method of approximate calculation for the partial correlation 
coefficient. The significance testing of sample (partial) 
regression coefficients is a particular example. 

The nomogram presented in this short paper deals with 
the graphical calculation of a partial correlation coefficient 
from three total correlation coefficients. By using a straight- 
forward method involving determinants, the author obtains 
an expression for the following: 


sin @ sin ¢ cos § cos 
sin 0 sin ¢—1’ sin 6 sin @—1 


(1, rp), (0, 7) and ( 


where rp = T12.3 = 7 cosec 8 cosec ¢—cot 6 cot $ 
Ue Seale 
and cos 0 = 743 
cos f = 123 


These points are collinear and enable the nomogram to be 
constructed. 


(W. R. Buckland) 
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